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Abstract 


This  report  constitutes  the  Final  Report  for  ONR  Contract  N0001478C0025.  It 
reviews  the  main  theoretical  and  empirical  developments  that  arose  from  this 
contract.  The  report  is  divided  into  three  main  sections.  The  first  briefly 
reviews  alternative  approaches  to  understanding  the  nature  of  intelligence. 

The  second  provides  the  proposed  componential  metatheory.  The  third  and  main 
section  describes  various  aspects  of  the  componential  theory  and  tests  of  this 
theory.  This  last  section  covers  inductive  and  deductive  reasoning.  The  report 
closes  with  some  conclusions  and  suggested  directions  for  future  research. 


Components  of  Individual  Differences 
in  Human  Intelligence 

During  the  twentieth  century,  a  great  diversity  of  approaches  has  emerged 
to  the  understanding  of  human  intelligence.  1  describe  here  one  of  these  ap¬ 
proaches,  which  1  refer  to  as  the  "componential  approach.”  My  presentation 
of  the  approach  is  divided  into  four  main  parts.  The  first  part,  an  introduc¬ 
tion,  sets  an  historical  and  contemporary  context  in  which  the  present  proposals 
may  be  viewed.  The  second  part  presents  a  componential  metatheory  or  framework, 
providing  a  description  of  the  basic  mechanisms  that  are  proposed  to  constitute 
the  human  intellectual  system.  The  third  part  provides  theory  derived  from 
the  componential  point  of  view,  and  data  collected  to  test  the  theory.  The 
fourth  part  draws  some  general  conclusions  from  the  metatheory,  theory,  and  data. 

In  recent  years,  two  of  the  most  influential  approaches  to  understanding 
Intelligence  have  been  the  psychometric  and  the  information-processing  ap¬ 
proaches.  My  own  approach,  and  that  of  many  other  contemporary  investigators, 
can  be  seen  as  a  synthesis  of  these  two  important  approaches.  (See,  for  example. 
Hunt,  1978,  for  another  approach  that  synthesizes  psychometrics  and  information 
processing  in  a  different  .»>.)  Each  of  the  psychometric  and  information¬ 
processing  approaches  has  sought  to  understand  the  nature  of  intelligence  in 
somewhat  different,  although  not  mutually  exclusive,  terms.  These  terms  are 


considered  below. 


Approaches  to  Understanding  the  Nature  of  Intelligence 

The  psychometric  approach  to  understanding  the  nature  of  Intelligence 
dominated  theory  and  research  in  the  first  half  of  the  century.  Investigators 
using  this  approach  sought  to  understand  Intelligence  by  analyzing  patterns 
of  individual  differences  in  scores  on  various  kinds  of  mental  tests,  such  as 
vocabulary,  number  series,  figural  analogies,  mental  rotation  of  geometric 
objects,  and  the  like.  The  basic  idea  was  that  underlying  scores  on  these 
manifest  measures  of  mental  abilities  were  one  or  more  latent  abilities  that 
gave  rise  to  the  observable  Individual  differences  in  test  scores.  If  these 
latent  sources  of  individual  differences  could  be  Identified,  then  the  structure 
of  human  intelligence  could  in  some  sense  be  understood.  A  statistical  proce¬ 
dure  called  "factor  analysis,"  which  analyzes  observable  patterns  of  individual 
differences  in  terms  of  hypothesized  latent  constructs  or  "factors,"  was  used 
to  make  the  hoped  for  identification. 

A  number  of  different  factorial  theories  of  human  intelligence  have  been 
proposed  that  differed  both  in  terms  of  the  numbers  and  the  identities  of  the 
factors  indicated.  At  the  lower  end  of  the  scale  of  factor  numerosity, 

Spearman  (1927)  proposed  that  intelligence  could  be  understood  in  terms  of  a 
general  factor  ("g")-- -which  he  tentatively  Identified  as  attributable  to 
Individual  differences  in  mental  energy— and  a  set  of  essentially  uninteresting 
and  clearly  subordinate  specific  factors,  each  of  which  was  relevant  only  to 
Individual  differences  in  the  performance  of  a  single  task.  Spearman's  theory 
thus  emphasized  just  a  single  factor  of  intelligence.  At  the  upper  end  of  the 
scale,  Guilford  (1967)  proposed  that  intelligence  could  be  understood  in  terms 
of  120  factors,  which  differed  in  terms  of  the  operations,  contents,  and  products 
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they  represented.  An  example  of  a  factor  In  Guilford's  system  would  be 
cognition  (process)  of  figural  (content)  relations  (product),  which  is  used 
to  Infer  the  nature  of  relationships  between  two  figural  terms,  as  in  a  figural 
analogy. 

Most  factorial  theories  of  intelligence  fall  between  the  extremes  of 
Spearman  and  Guilford  in  the  numbers  of  factors  they  propose.  Thurstone's 
(1938)  well-known  theory  of  primary  mental  abilities,  for  example,  posits 
seven  factors,  namely,  verbal  comprehension,  word  fluency,  number  facility, 
spatial  visualization,  reasoning,  perceptual  speed,  and  memory. 

Despite  its  initial  popularity  among  investigators  of  intelligence,  the 
psychometric  approach  to  understanding  intelligence  has  become  a  source  of 
increasing  disenchantment  during  the  latter  half  of  the  twentieth  century. 

The  full  range  of  reasons  for  this  disenchantment  are  too  numerous  and  complex 
to  consider  in  detail  here  (but  see  Carroll,  1978;  Sternberg,  1977b).  The 
main  reasons,  stated  briefly,  seem  to  have  been  (a)  difficulties  in  distin¬ 
guishing  among  and  in  empirically  discontinuing  alternative  factorial  theories, 
especially  through  the  use  of  factorial  methods,  (b)  the  almost  exclusive 
reliance  of  factorial  methods  upon  individual  differences  for  the  identifi¬ 
cation  of  the  constructs  constituting  intelligence,  with  the  concomitant 
assumption  that  a  structure  can  be  a  constituent  of  intelligence  only  if 
it  generates  observable  individual  differences  in  task  performance,  and  (c) 
the  failure  of  factorial  methods  directly  to  identify  the  processes  that 
combine  to  constitute  task  performance.  The  structural  model  of  factor 
analysis  seems  to  provide  a  useful,  but  incomplete  perspective  on  intellectual 
performance.  A  further  perspective  seems  to  be  needed  that  will  shed  light 
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on  process  as  veil  as  structure.  The  Information-processing  approach  seems 
to  provide  such  a  perspective. 

The  information-processing  approach  1:0  human  intelligence  seeks  to 
understand  Intelligence  in  terms  of  the  underlying  processes  that  in  various 
combinations  constitute  intelligent  task  performance.  Although  the  primary 
emphasis  of  the  approach  is  upon  process  identification,  the  approach  seeks 
also  to  identify  the  speeds  and  difficulties  with  which  these  processes  are 
executed,  the  strategies  for  task  performance  into  which  the  processes  combine, 
the  mental  representatives  upon  which  the  underlying  processes  and  strategies 
act,  and  the  allocation  of  attention  and  other  processing  resources  to  various 
aspects  of  a  given  task. 

Whereas  various  psychometric  theories  of  intelligence  are  usefully 
distinguished  in  terms  of  the  numbers  and  Identities  of  the  factors  they  pro¬ 
pose,  various  information-processing  theories  are  usefully  distinguished  in 
terms  of  what  might  be  referred  to  as  the  "level  of  processing"  to  which  they 
seek  to  ascribe  the  antecedents  of  Intelligent  performance,  with  levels  ranging 
from  the  perceptual-motor  level  to  the  level  of  complex  problem  solving.  Whereas 
no  sensible  theorist  would  seek  to  account  for  all  Intelligent  behavior  in  terms 
of  processes  operating  at  just  a  single  level  (or  narrow  range  of  levels,  since 

levels  are  best  conceived  of  as  continuous),  most  theorists  seem  to  emphasize 

/ 

a  single  level  or  a  fairly  narrow  adjacent  set  of  levels  in  their  theories. 
Although  most  theorists  emphasize  a  particular  set  of  levels,  they  generally 

also  attempt  to  deal  at  least  somewhat  with  a  broader  range  of  processing. 

At  the  extreme  of  simplicity  can  be  found  the  theory  of  Jensen  (1979).  Jensen 
has  sought  to  understand  intelligence  in  terms  of  very  simple  perceptual-motor 
information  processing,  as  is  found  in  simple  reaction  time  (time  to  offer  a 
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•ingle  response  to  a  single  stimulus)  and  choice  reaction  time  (time  to  offer 
one  of  several  possible  responses  to  one  of  several  possible  stimuli).  At  the 
extreme  of  complexity  can  be  found  theories  such  as  those  of  Anderson  (1976), 
Newell  and  Simon  (1972),  and  Schank  (1980),  which  have  sought  to  explain 
Intelligence  primarily  in  terms  of  complex  language  understanding  (e.g.,  sentence 
and  story  comprehension)  and  problem  solving  (e.g.,  logical  theorem-proving 
and  chess  performance).  At  levels  in  between  these  two  extremes  can  be  found 
theories  such  as  Hunt’s  (1978,  1980),  which  is  closer  to  the  ’’simpler"  end  of 
the  continuum  and  which  seeks  to  understand  intelligence  partly  in  terms  of 
speed  of  access  to  lexical  codes  stored  in  long-term  memory  (e.g.,  the  name 
of  the  letter  "A"),  and  theories  such  as  my  own  (Sternberg,  1979,  1980d,  1981b,  d) , 
which  is  closer  to  the  more  "complex"  end  of  the  continuum  and  which  seek  to 
understand  intelligence  primarily  in  terms  of  reasoning  and  verbal  comprehension 
(e.g.,  the  solution  of  analogies  and  the  figuring  out  of  the  meaning  of  a 
previously  unknown  word  encountered  in  a  natural  context,  such  as  a  newspaper 
article).  Having  placed  my  own  theory  in  the  context  of  some  others,  I  shall 
proceed  to  a  brief  description  of  the  metatheory  underlying  the  theory,  and 
then  shall  consider  jointly  the  theory  and  the  data  that  have  been  collected 
to  test  various  aspects  of  it. 


6 


Componential  Metatheory 

The  basic  unit  In  my  own  theory  Is  the  component  process.  Each  process 
or  "component,"  has  three  Important  properties  associated  with  It:  duration, 
difficulty  (l.e.,  probability  of  Its  execution  eventuating  in  an  erroneous 
result),  and  probability  of  execution.  The  three  properties  are,  at  least  in 
principle,  independent.  For  example,  a  given  component  may  take  a  rather  long 
time  to  execute,  but  may  be  rather  easy  to  execute,  in  the  sense  that  its 
execution  rarely  leads  to  an  erroneous  or  otherwise  inadequate  outcome;  or 
the  component  may  be  executed  quite  rapidly,  and  yet  be  rather  difficult  t<- 
execute,  in  the  sense  that  its  execution  often  leads  to  an  error.  Consiuei 
for  example,  "mapping,"  one  component  used  in  solving  analogies  such  as  LAI 
is  to  CLIENT  as  DOCTOR  is  to  (a)  PATIENT  or  (b)  MEDICINE.  Mapping  calls  for 
the  discovery  of  the  higher-order  relation  between  the  first  and  second  halves 
of  the  analogy.  The  component  has  a  certain  probability  of  being  executed  in 
solving  an  analogy,  and,  if  executed,  it  has  a  certain  duration  and  a  certain 
probability  of  being  executed  incorrectly. 
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Kinds  of  Components 

Components  can  be  classified  by  the  kind  of  function  they  perform.  1 
■hall  consider  each  function  in  turn. 

Metacomponents  are  higher-order  control  processes  used  for  executive 
planning  and  decision  making  in  task  performance.  Collectively,  they  are 
sometimes  referred  to  by  psychologists  as  the  "executive"  or  the  "homunculus." 
The  ten  metacomponents  that  I  believe  are  most  important  in  intelligent 
functioning  are  (a)  recognition  that  a  problem  of  some  kind  exists,  (b) 
recognition  cf  just  what  the  nature  of  the  problem  is,  (c)  selection  of  a 
set  of  lower-order  components  for  performing  a  task,  (d)  selection  of  a 
strategy  for  task  performance  into  which  to  combine  the  lower-order  components, 
(e)  selection  of  one  or  more  mental  representations  for  information,  (f) 
decision  as  to  how  to  allocate  attentional  resources,  (g)  monitoring,  or 
keeping  track  of  one's  place  in  task  performance,  and  of  what  has  been  done 
and  what  needs  to  be  done,  (h)  understanding  of  internal  and  external  feedback 
concerning  the  quality  of  task  performance,  (i)  knowing  how  to  act  upon  the 
feedback  that  is  received,  and  (j)  actually  acting  upon  the  feedback.  Note 
that  this  last  metacomponent  in  effect  assigns  a  crucial  role  to  action  in 
the  theory  of  intelligent  performance.  I  do  not  believe  that  one  can  have 
an  adequate  theory  of  Intelligence  without  considering  both  thought  and  the 
actions  that  emanate  from  it. 

Performance  components  are  lower-order  processes  used  in  the  execution 
of  various  strategies  for  task  performance.  Because  the  bulk  of  the  next 
section  of  this  article  will  deal  with  various  performance  components  and 
their  roles  in  task  performance,  only  one  example  of  a  performance  component 
will  be  given  here  (to  supplement  the  example  of  "Mapping,"  a  performance 
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component  briefly  described  earlier).  The  example  is  that  of  inference, 
the  component  by  which  an  individual  figures  out  similarities  and  differences 
between  or  among  a  set  of  objects.  (In  contrast,  mapping  involves  figuring 
out  higher-order  relations  between  or  among  lower-order  relations.) 

Acquisition  components  are  processes  involved  in  learning  new  information 
and  storing  it  in  memory.  An  example  of  such  a  component  is  rehearsal  of  words 
in  a  to-be-learned  list.  Retrieval  components  are  processes  involved  in 
accessing  information  that  has  already  been  acquired  and  stored  in  memory, 
for  example,  accessing  a  lexical  code.  Transfer  components  are  processes 
involved  in  generalizing  stored  and  retrieved  information  from  one  situation 
to  another,  for  example,  recognizing  that  two  different  terms  mean  the  same 
thing.  As  of  yet,  the  processes  of  transfer  are  but  poorly  understood. 

The  various  kinds  of  components  considered  above  are  applied  in  task 
performance  toward  the  reaching  of  a  solution  or  other  goal.  Components 
can  vary  greatly  in  the  range  of  tasks  to  which  they  apply.  Some  components, 
and  especially  the  metacomponents,  appear  to  be  broadly  applicable  over  a 
wide  range  of  tasks.  Other  components  apply  to  only  a  narrow  range  of  tasks; 
such  components  are  of  little  theoretical  interest,  and  generally  of  little 
practical  interest  as  well.  My  concern  here  will  be  only  with  components 
that  I  believe  are  fairly  generalizable  over  a  vide  range  of  tasks. 
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Interactions  among  Components 

The  various  kinds  of  components  described  above  are  theorized  to  be 
highly  Interactive.  Four  kinds  of  interactions  need  to  be  considered: 
direct  activation  of  one  kind  of  component  by  another  kind;  indirect 
activation  of  one  kind  of  component  by  another  kind  via  the  mediation  of 
a  third  kind  of  component;  direct  feedback  from  one  kind  of  component  to 
another  kind;  and  indirect  feedback  from  one  kind  of  component  to  another 
kind  via  a  third  kind.  In  the  proposed  system,  only  metacomponents  can 
directly  activate  and  receive  feedback  from  each  other  kind  of  component. 

Thus,  all  control  passes  directly  from  the  metacomponents  to  the  system, 
and  all  information  passes  directly  from  the  system  to  the  metacomponents. 

The  other  kinds  of  components  can  activate  each  other  only  indirectly,  and 
receive  feedback  from  each  other  only  indirectly;  in  every  case,  mediation 

i 

must  be  supplied  by  the  metacomponents.  For  example,  the  acquisition  of 
information  affects  the  retrieval  of  information  and  the  various  kinds  of 
performances  that  can  be  done  upon  that  information,  but  only  via  the  link 
of  the  three  kinds  of  components  to  the  metacomponents.  Information  from 
the  acquisition  components  is  filtered  to  the  other  kinds  of  components 
through  the  metacomponents.  Metacomponents  are  also  unique  among  the 
various  kinds  of  components  in  that  they  can  directly  activate  and  receive 
feedback  from  each  other. 

Consider  a  simplified  example  of  hov  the  proposed  system  might  function 
in  the  solution  of  a  word  puzzle,  such  as  an  anagram  (where  the  letters  of 
the  word  are  presented  in  scrambled  fashion).  As  soon  as  one  decides 
metacomponentially  upon  a  certain  tentative  strategy  for  unscrambling  the 
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letters  of  the  word,  activation  of  that  strategy  can  pass  directly  from  the 
metacomponent  responsible  for  deciding  upon  a  strategy  to  the  performance 
component  responsible  for  executing  the  first  step  of  the  strategy,  and 
subsequently,  activation  can  pass  to  the  successive  performance  components 
needed  to  execute  the  strategy.  Feedback  will  return  from  the  performance 
components  indicating  how  successful  the  strategy  is  turning  out  to  be.  The 
Individual  must  decide  how  to  act  upon  this  feedback,  and  then  must  actually 
perform  the  required  actions. 

As  a  given  strategy  lr  being  executed,  new  information  is  being  acquired 
about  how  to  solve  anagrams,  in  general.  This  information  is  also  fed  back 
to  the  metacomponents,  which  may  act  upon  or  ignore  this  information.  New 
information  that  seems  useful  is  more  likely  to  be  directed  back  from  the 
relevant  metacomponents  to  the  relevant  transfer  components  for  use  in  later 
anagram  (or  other)  problems,  as  needed.  Thus,  failure  to  transfer  information 
may  be  due  to  Inadequate  operation  of  transfer  components,  but  alternatively, 
it  may  be  due  to  Inadequate  metacomponential  activation.  Inadequate  feedback 
to  metacomponents  from  performance  components,  or  inadequate  functioning  of 
acquisition  components  that  are  responsible  for  learning  the  information  in 
the  first  place.  Such  is  the  nature  of  an  interactive  system. 

The  metacomponents  are  able  to  process  only  a  limited  amount  of  information 
at  a  given  time.  In  a  difficult  task,  and  especially  a  new  and  different  one, 
the  amount  of  information  being  fed  back  to  the  roetacomponents  may  exceed 
capacity  to  act  upon  that  information.  In  this  case,  the  metacomponents 
become  overloaded,  and  valuable  information  that  cannot  be  processed  may  simply 
be  wasted.  The  total  information-handling  capacity  of  the  metacomponents  of 


11 


a  given  system  will  thus  be  an  important  limiting  aspect  of  the  system.  This 
capacity  can  effectively  be  increased  by  automatization  of  component  execution. 
Automatic  processing  of  information  is  theorized  to  require  far  less  in  the 
way  of  attentlonal  resources  than  Is  required  by  controlled  processing  (see 
Sternberg,  1931a  ). 

To  summarize,  it  has  been  proposed  that  human  intelligence  can  be 
understood  in  terms  of  the  kinds  of  components  constituting  its  functioning, 
and  in  terms  of  various  kinds  of  interactions  among  those  kinds  of  components. 
In  the  next  section,  it  is  shown  how  the  kind  of  componential  viewpoint 
presented  here  can  be  applied  concretely  to  the  understanding  of  intelligent 


task  performance. 
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Componential  Theory  and  Empirical  Tests 

Over  Che  past  several  years,  my  collaborators  and  1  have  been  engaged  In 
developing  a  componentlal  theory  of  human  Intelligence  that  uses  as  its 
conceptual  foundation  the  metatheory  described  above.  The  goal  has  been  to 
provide  an  integrated,  and  in  some  respects,  unified  account  of  human  intellec¬ 
tual  functioning.  Although  ve  have  studied  a  number  of  different  tasks 
requiring  a  number  of  different  components  of  information  processing,  perfor¬ 
mances  on  the  tasks  are  theorized  minimally  to  have  in  conraon  (a)  the  kinds  of 
components  (metacomponents,  performance  components,  acquisition  components, 
retrieval  components,  transfer  components)  relevant  to  task  performance,  with 
their  attendant  properties  of  duration,  difficulty,  and  probability  of 
execution,  (b)  the  particular  (ten)  metacomponents  noted  earlier,  which  are 
applicable  to  functioning  in  virtually  any  cognitive  task,  and  (c)  the  scheme 
of  interaction  among  the  various  kinds  of  components,  also  as  noted  earlier. 
Hence,  the  metacomponential  framework  provides  one  unifying  set  of  constructs 
for  understanding  what  is  common  across  essentially  the  full  range  of  intellec¬ 
tual  tasks. 

What  makes  intellectual  tasks  more  or  less  similar  to  each  other  in  their 
Information  processing  requirements  is  the  degree  of  overlap  in  the  performance 
components  utilized  in  task  execution.  Hence,  the  particular  theories  of  task 
performance  to  be  described  provide  a  basis  for  understanding  why  performances 
on  certain  kinds  of  tasks  can  be  viewed  as  more  or  less  Interrelated,  both 
from  the  point  of  view  of  shared  information  processes  and  the  point  of  view 
of  correlated  patterns  of  individual  differences.  On  this  view,  the  psycho¬ 
metric  factors  representing  common  sources  of  individual  differences  across 
tasks  are  derivatives,  in  some  sense,  of  shared  information  processes  across 
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these  tasks.  What,  exactly,  are  these  information  processes,  and  how  are  they 
shared  across  tasks?  It  is  to  a  consideration  of  these  Issues  that  I  will 
shortly  turn. 

It  is  useful,  for  our  present  purposes,  to  divide  our  consideration  of 
Intellectual  abilities  into  three  broad  classes:  fluid  abilities,  crystallized 
abilities,  and  social/practical  abilities.  Fluid  abilities  are  skills  involved 
in  the  solution  of  problems  such  as  figural  analogies,  number  series,  and 
verbal  classifications  of  related  terms.  Crystallized  abilities  are  the  skills 
and  knowledge  drawn  upon  in  the  performance  of  tests  of  things  such  as  vocabu¬ 
lary,  reading  comprehension,  and  general  information.  Social/practical 
abilities  are  skills  used  in  the  everyday  socio-cultural  problems  one  faces  in 
one's  life.  There  are  three  interrelated  reasons  why  this  particular  classi- 
ficatory  scheme  makes  sense.  First,  at  least  the  first  two  aspects  of  it,  and 
to  a  lesser  extent,  the  third,  seem  to  correspond  well  to  the  broad  "group 
factors"  (factors  spanning  wide  groups  of  tests,  but  not  the  entire  range  of 
tests)  that  have  emerged  in  psychometric  research  (e.g.,  Cattell,  1971;  Horn, 
1968;  Vernon,  1971).  Second,  our  research  and  that  of  others  suggests  that 
this  particular  scheme  comes  close  to  maximizing  the  similarity  of  the 
performance  components  used  in  tasks  measuring  any  one  of  these  three 
constellations  of  abilities  while  maximizing  the  dissimilarity  of  the 
performance  components  used  across  tasks  from  different  ones  of  the  three 
constellations  of  abilities  (e.g.,  Pellegrino  &  Glaser,  1980;  Snow,  1978; 
Sternberg,  1980d)>  And  third,  it  so  happens  that  if  one  assesses 

people's  implicit  theories  (internalized  conceptions)  of  the  nature  of 
Intelligence,  these  three  constellations  happen  to  characterize  the  implicit 
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theories  of  both  laypersons  and  experts  in  the  field  of  intelligence  (Sternberg, 
Conway,  Ketron,  &  Bernstein,  1981).  Consider  now  what  some  of  the  performance 
components  are  that  constitute  abilities  in  the  first  of  these  domains,  fluid 
abilities.  This  is  the  domain  to  which  I  and  many  other  theorists  of  intelli¬ 
gence  have  so  far  devoted  the  most  attention.  For  organizational  convenience 
and  further  reasons  of  psychological  theory,  my  discussion  of  fluid  abilities 
will  be  subdivided  into  separate  sections  on  induction  and  deduction. 


i 


15 


Fluid  Abilities :  Induction 

Our  investigations  of  fluid  abilities  of  inductive  reasoning  have  led  us 

to  the  analysis  of  a  number  of  tasks,  including  analogies  (Sternberg,  1977a, 

1977b;  Sternberg  &  Nigro,  1980;  Sternberg  &  Rifkin,  1979);  series  completions 

(Sternberg  &  Gardner,  1982,  1983);  classifications  (Sternberg  &  Gardner, 

198^7  1983  ) ;  metaphorical  understanding  and  appreciation  (Sternberg, 

Tourangeau,  &  Nigro,  1979;  Tourangeau  &  Sternberg,  1981,  1982};  causal 

inferences  (Schustack  &  Sternberg,  1981);  and  projection  of  future  events 
1982) . 

(Sternberg,  1981b,  .  For  the  purposes  of  this  summary  overview,  I  will  discuss 
each  of  these  tasks  briefly,  describing  first  the  theory  underlying  it  and 
then  the  data  that  have  been  collected  to  test  the  theory. 

Theory  of  analogical  reasoning  processes.  My  first  investigations  of  fluid 
abilities— indeed,  of  any  mental  abilities — were  of  analogical  reasoning 
abilities.  In  its  present  form,  my  theory  states  that  analogical  reasoning 
can  be  decomposed  into  seven  underlying  performance  components,  which  I  will 
illustrate  by  drawing  upon  the  simple  analogy,  LAWYER  is  to  CLIENT  as  DOCTOR 
is  to  (a)  MEDICINE,  (b)  SICK  PERSON.  The  seven  components  are  (a)  encoding, 
by  which  the  individual  recognizes  the  terms  of  the  problem  and  accesses 
attributes  of  the  analogy  terms  that  are  stored  in  semantic  memory  and  that 
might  be  relevant  to  task  solution,  (b)  inference,  by  which  the  individual 
figures  out  the  relationship  between  the  first  two  terms  of  the  analogy  (e.g., 
that  a  lawyer  renders  professional  consulting  services  to  a  client),  (c) 
mapping .  by  which  the  individual  figures  out  the  higher-order  relation  between 
the  two  halves  of  the  analogy  (e.g.,  that  both  a  lawyer  and  a  doctor  render 
professional  services) ,  (d)  application,  by  which  the  individual  takes  the 
relationship  inferred  between  the  first  two  terms  in  the  first  half  of  the 
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analogy  as  mapped  to  the  third  term  in  the  6econd  half  of  the  analogy  and  uses 
this  relationship  to  generate  an  "ideal"  completion  to  the  analogy  (e.g.,  tie 
individual  might  generate  PATIENT  as  an  ideal  completion),  (e)  comparison,  by 
which  the  Individual  compares  each  of  the  given  answer  options  (in  multiple- 
choice  analogies)  to  the  ideal,  and  decides  which  is  better  (in  the  sense  of 
more  closely  resembling  the  ideal)  e.g.,  the  Individual  will  compare  each  of 
MEDICINE  and  SICK  PERSON  to  PATIENT,  (f)  justification,  by  which  the  individual 
decides  whether  the  preferred  answer  option  is  close  enough  to  the  ideal  option 
to  warrant  its  selection,  or  whether  instead  the  possibility  ought  to  be 
entertained  and  possibly  acted  upon  that  an  error  has  been  made  in  earlier 
information  processing  (e.g.,  the  individual  might  decide  that  SICK  PERSON, 
although  not  an  ideal  response,  is  at  or  above  some  criterion  for  a  minimally 
acceptable  response),  and  (g)  response,  by  which  the  individual  communicates 
his  or  her  choice  of  an  answer  (e.g.,  the  individual  might  circle  or  press  a 
button  indicating  his  or  her  choice  of  SICK  PERSON  as  the  preferred  answer). 

I  initially  tested  this  theory  of  information  processing  (minus  the 
comparison  component,  which  had  not  yet  become  part  of  the  theory  at  the  time) 
under  the  assumptions  that  (a)  the  components  were  combined  sequentially  (serially) 
and  that  (b)  they  acted  upon  an  attribute-value  mental  representation  for 
information,  meaning  that  each  analogy  term  could  be  decomposed  into  a  set  of 
attributes  with  some  range  of  possible  values  on  each  attribute,  for  example, 
height:  tall-short.  Alternative  models  were  tested  that  varied  in  terms  of 
which  components  of  the  theory  were  exhaustive  (all  encoded  attributes 
processed,  and  which  seif-terminating  (only  a  proper  subset  of  the  encoded 

attributes  processed),  and  alternative  theories  were  also  tested  that 
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differed  in  the  numbexs  of  components  theorized  to  be  needed  for  analogy 
solutions. 

Subjects  in  the  first  set  of  experiments  were  Stanford  undergraduates. 

One  group  of  16  subjects  solved  1152  schematic-picture  analogies  and  68 
verbal  analogies;  a  second  group  of  24  subjects  solved  90  geometric  analogies. 
An  example  of  each  of  the  three  kinds  of  items  is  shown  in  Figure  2 .  Note  that 


Insert  Figure  2  about  here 


the  schematic-picture  and  verbal  analogies  were  presented  in  a  true-false 
format,  and  the  geometric  analogies  were  presented  in  a  forced-choice  format. 

The  primary  dependent  variable  was  response  time  (with  error  rate  serving  as 
a  secondary  dependent  variable)  and  the  independent  variables  were  manipulations 
of  various  aspects  of  item  difficulty  that  were  needed  in  order  to  separate 
parameters  representing  durations  of  the  theorized  component  processes. 

Subjects  were  tested  tachistoscopically,  meaning  that  they  sat  in  front  of 
a  large  box-like  contraption  that  presented  stimuli,  measured  response  time, 
and  recorded  response  choices.  Tachistoscopic  testing  was  followed  by  paper- 
and-pencll  testing  on  psychometric  measures  of  reasoning  and  perceptual-motor 
•peed  abilities. 

Theory  testing  was  accomplished  via  mathematical  modeling.  Each  alter¬ 
native  theory  and  model  was  initially  expressed  as  an  information-processing 
flow-chart  representing  each  performance  compere as  a  box  in  the  chart. 

These  models  were  then  quantified  by  assigning  a  mathematical  parameter  to 
represent  the  duration  (or  difficulty)  of  a  given  processing  component. 


Automobile  :  Road  : :  Train  :  Caboose 


X  X  0  K3  ca 


Figure  2.  Examples  of  schematic-picture,  verbal,  and  geometric  analogies. 
The  first  two  analogy  types  were  true-false;  the  last  was  forced-choice.  Subjects 
were  told  either  to  determine  whether  the  analogy  was  true  or  false,  or  to  determine 
which  of  two  answer  options  better  completed  the  analogy.  Subjects  solving  schematic- 
picture  analogies  were  told  that  found  binary  attributes— height  (tall,  short),  weight 
(fat,  thin),  sex  (male,  female),  and  clothing  color  (blue,  red,  in  the  actual  stimuli) 
were  relevant  to  item  solution. 
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Multiple  linear  regression  was  used  to  predict  the  mean  response  tine  to  each  j 

analogy  item  type  from  the  appropriate  set  of  independent  variables  representing  j 

the  number  of  times  a  given  component  had  to  be  performed  in  analogy  solution.  j 

This  model-testing  procedure  was  possible  because  items  had  been  systematically 
constructed  so  as  to  vary  the  sources  of  difficulty  involved  in  their  solution. 

Mean  response  times  were  1.42  seconds  for  schematic-picture  analogies, 

2.42  seconds  for  verbal  analogies,  and  6.58  seconds  for  geometric  analogies. 

Error  rates  were  very  low  (ranging  from  1-5%  across  tasks).  The  proposed 

theory  provided  the  best  fit  to  the  latency  data,  accounting  for  .92,  .86,  and 

.80  of  the  variance  in  the  schematic  picture,  verbal,  and  geometric  group-mean 

2 

latency  data.  (These  proportions  are  the  squared  correlations  £_R_j  between 

predicted  and  observed  data  points  for  each  of  the  analogy  item  types  in  each 

of  the  three  data  sets.)  Root-mean-square  deviations  (RMSD's)  between  predicted 

and  observed  data  points  (measuring  absolute  badness  of  fit)  were  .13,  .26,  and  | 

1.68  seconds  for  the  schematic-picture,  verbal,  and  geometric  analogies 
respectively.  The  preferred  model  under  the  proposed  theory  was  one  in  which 
the  inference  component  was  exhaustive  (meaning  that  all  encoded  attributes 
were  subjected  to  the  Inference  process),  but  in  which  mapping  and  application 
were  self-terminating  (meaning  that  only  a  portion  of  the  encoded  attributes 
were  subjected  to  the  mapping  and  application  components).  It  is  worth  noting 
that  although  the  proposed  theory  did  quite  a  reasonable  job  of  accounting  for 
variance  In  the  latancy  data,  the  unexplained  variance  was  statistically 
significant,  meaning  that  the  proposed  theory  is  not  equivalent  to  the  "true 
theory."  There  is  systematic  variation  that  still  remains  to  be  explained. 

Patterns  of  correlations  between  the  global  (average  response  tine  over 
items  for  a  given  subject)  and  parameter  (response  latency  for  a  particular 
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processing  component)  scores  of  the  analogies  tasks,  on  the  one  hand,  and  the 
psychometric  tests,  on  the  other,  were  mixed.  The  perceptual-motor  psychometric 
tests  had  been  included  for  purposes  of  discriminant  validation,  meaning  that 
1  had  hoped  that  individuals*  task  scores  would  not  be  correlated  with  them. 
Significant  correlations  of  this  kind  would  suggest  that  the  tachlstoscoplcally 
administered  analogies  test  was  measuring  perceptual-motor  speed,  a  construct 
that  was  not  of  interest  in  the  present  studies.  Happily,  neither  global  task 
scores  nor  parameter  scores  showed  any  significant  correlations  with  the  percep¬ 
tual-speed  measures.  But  the  psychometric  reasoning  tests  were  included  for 
purposes  of  convergent  validation,  meaning  that  I  had  hoped  that  global  task 
and  appropriate  parameter  scores  would  be  correlated  with  the  reasoning  measures. 
Although  the  correlations  between  the  global  task  and  test  scores  were  adequate 
(ranging  from  about  -.4  to  -.6  across  the  three  tasks,  with  negative  correla¬ 
tions  indicating  that  shorter  task  latencies  were  associated  with  greater 
numbers  correct  on  the  psychometric  tests),  the  localization  of  the  loci 
responsible  for  these  correlations  were  disappointing.  Although  there  were 
some  significant  correlations  of  reasoning-type  parameters  (inference,  mapping, 
application,  justification)  with  the  test  scores,  most  of  the  global  correlation 
turned  out  to  be  localized  in  the  correlation  of  the  response  parameter  with 
the  reasoning  score!  Although  I  proposed  several  possible  explanations  for 
this  surprising  phenomenon  at  the  time  (Sternberg,  1977b),  it  remained  for 
later  research  to  show  that  the  result  was  due  to  several  factors,  namely, 
unreliability  of  the  reasoning  component  scores  in  contrast  to  the  response 

component  score  vwmen  depressed  t'ne  possiDie  correlations  one  could  ootain  witn 

the  former  type  of  score),  and  (b)  confounding  of  metacomponential  latencies 


(which  would  be  expected  to  correlate  with  psychometrically  measured  reasoning) 
with  the  response-component  latency. 

Why  did  this  confounding  occur?  The  response  component  score  was  estimated 
as  a  regression  constant  in  the  mathematical  model,  meaning  that  it  was  estimated 
as  that  portion  of  response  time  that  was  corstant  across  item  types  within  a 
given  set  (schematic-picture,  verbal,  or  geometric)  of  analogies.  The  use  of 
this  (fairly  standard)  procedure  for  estimating  response  component  time  had 
the  unfortunate  result  that  any  component  latency  that  was  constant  across  iter, 
types  was  confounded  with  the  response  constant.  In  subsequent  studies,  my 
colleagues  and  I  attempted  to  obtain  more  reliable  reasoning  component  scores 
and  to  increase  the  range  of  item  difficulty  and  variety  in  order  better  to 
isolate  the  response  component  from  any  other  component. 
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Extension  of  theory  to  series  completions  and  classifications.  Tne 
modeling  data  described  above  provided  good  support  for  the  proposed  theory 
of  analogical  reasoning  processes.  Eut  the  correlational  results  were  perplexing. 
The  data  set  as  a  whole  raised  two  Important  theoretical  questions. 

First,  could  the  theory  of  analogical  reasoning  processes  be  extended  to 
other  kinds  of  inductions  so  as  to  demonstrate  some  generality  of  the  theory’s 
ability  to  account  for  information  processing  during  inductive  reasoning,  in 
general,  as  well  as  during  analogical  reasoning,  in  particular.  The  theory  would 
be  of  considerably  greater  psychological  interest  if  its  components  could  provide 
an  account  of  more  than  just  one  kind  of  induction,  and  to  the  extent  that  the 
historical  claim  is  correct  that  reasoning  by  analogy  provides  a  paradigm  for 
inductive  thought  (Reitman,  1965;  Spearman,  1923),  the  theory  should  generalize. 

Second,  the  pattern  of  correlation  between  parameters  of  inductive  reasoning 
tasks  and  scores  on  psychometric  tests  needed  to  be  clarified.  If  the  proposed 
theory  of  reasoning  were  correct,  then  with  sufficiently  reliable  task,  parameter 
estimates  and  psychometric  test  scores,  and  with  sufficient  variation  in  item 
types,  parameters  representing  durations  of  reasoning  components — inference, 
mapping,  application,  comparison,  justification — should  show  statistically 
reliable  correlations  with  psychometrically-measured  reasoning  ability,  and  the 
response  parameter  should  not,  or  at  best  show  a  weak  correlation. 

Twenty-four  Yale  undergraduates  were  confronted  with  three  tasks — analogies, 
aeries  completions,  and  classifications— presented  via  three  kinds  of  contents — 
schematic  pictures,  words,  and  geometric  forms  (Sternberg  &  Gardner,  1982, 

1983),  An  example  of  each  of  the  nine  kinds  of  items  used  (3  tasks  x  3 
contents)  is  shown  in  Figure  3.  A  total  of  2880  items  was  administered  to  each 
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Insert  Figure  3  about  here 


subject.  This  large  number  of  items  was  Intended  to  Insure  highly  reliable 
global-task  scores  and  Individual  parameter  estimates.  In  addition,  subjects 
in  the  experiment  received  two  forms  of  each  of  six  psychometric  tests  of 
reasoning  abilities  and  three  of  perceptual-speed  abilities,  again  to  insure 
reliability  of  scores.  Total  testing  time  was  about  25  hours  per  subject, 
spread  out  over  a  number  of  separate  sessions.  As  in  the  earlier  analogy 
experiments,  the  primary  dependent  variable  was  response  time  (with  error 
rates  as  a  secondary  dependent  variable),  and  independent  variables  were  formed 
by  manipulations  of  various  aspects  of  item  difficulty.  Also  as  in  the  earlier 
experiments,  task  items  were  presented  tachistoscopically ,  with  subjects  timed 
in  their  latency  of  response  to  each  item. 

As  in  the  analogies  research,  it  was  assumed  that  performance  components 
were  executed  sequentially,  and  that  these  components  acted  upon  an  attribute- 
value  representation  for  information.  Because  the  comparison  component  was  being 
newly  added  to  the  theory,  the  ability  of  the  theory  to  account  for  the  latency 
data  with  the  additional  component  was  compared  to  the  ability  of  the  theory  to 
account  for  the  data  without  this  additional  component.  The  question  of  interest 
was  whether  the  additional  component  latency — as  represented  by  an  additional 
parameter  in  the  mathematical  model,  accounted  for  statistically  significant 
and  practically  substantial  additional  amounts  of  variance  in  the  data.  The 
answer  to  this  question  proved  to  be  affirmative,  and  the  data  presented  here 
are  for  the  full  set  of  parameters  used  to  model  the  full  set  of  data  points 
(with  each  group-mean  iter,  latency  serving  as  a  data  point)  for  the  nine  tasks. 
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Figure  3.  Examples  of  schematic-picture,  verbal,  and  geometric  analogies, 
series  completions,  and  classifications.  All  items  were  forced  choice.  In  the 
analogies,  subjects  had  to  choose  the  final  term  that  was  related  to  the  third 
term  in  the  same  way  that  the  second  term  was  related  to  the  first  term.  In  the 
series  completions,  subjects  had  to  choose  the  final  term  that  completed  the 
series  pattern  set  by  the  first  three  terms  as  projected  to  the  fourth  term. 

In  the  classification,  subjects  had  to  decide  in  which  of  two  categories  (each 
represented  by  two  terms)  a  target  stimulus  better  fit.  Subjects  solving  schematic- 
picture  items  were  told  that  four  binary  attributes— hat  color  (white,  black), 
vest  pattern  (striped,  polka-dotted),  handgear  (umbrella,  briefcase),  and  footwear 
(boots,  shoes) — were  relevant  to  item  solution. 
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Tree  :  Forest  ::  Soldier  :  (a)  General  (b)  Array 


Second  : 

Minute  :  Hour  : : 

Decade 

• 

(a) 

Time  (b) 

Century 

Rarely  : 

Sometimes  :  Often 

;  i 

Many  : 

(a) 

Frequently 

(b)  Most 

Baby  carriage  :  Tricycle  :  Bicycle  :: 

Measles  :  (a)  Illness  (b)  Acne 


(a)  Dictionary  Encyclopedia  (b)  Lemonade  Rum 

Gasoline 

(a)  Furnace  Stove  (b)  Refrigerator  Air  conditioner 

Oven 


(a)  Germany  France 


(b)  Vietnam  Korea 

Italy 


I 
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Again,  mathematical  modeling  was  accomplished  by  linear  multiple  regression. 

Mean  response  times  across  the  nine  tasks  ranged  from  2.92  to  5.87  seconds, 

with  a  median  of  4.50  seconds.  There  was  only  a  weak  effect  of  reasoning  task, 

but  in  terms  of  content,  geometric  items  were  clearly  more  difficult  than  the 

other  two  kinds  of  items.  Error  rates  ranged  from  .002  to  .055  with  a  median 

of  .028  across  the  nine  tasks. 

2 

Values  of  R  (proportion  of  variance  in  the  group-mean  latency  data  accounted 

for)  ranged  over  the  nine  tasks  from  .49  to  .94  with  a  median  of  .67.  The  major 

2 

variable  affecting  R  was  content  rather  than  reasoning  task.  In  particular, 

2  2 
values  of  R  were  lower  for  geometric  items.  Median  values  of  R  were  .76  for 

schematic-picture  items,  .67  for  verbal  items,  and  .58  for  geometric  items.  As 
was  the  case  in  the  previous  experiments,  the  unaccounted  for  variance  was 
statistically  significant.  Values  of  RMSD  ranged  across  the  nine  tasks  from 
.18  second  to  1.90  seconds,  with  a  median  of  .71  second.  We  were  particularly 
interested  in  these  experiments  in  examining  patterns  of  correlations  between 
experimental  task  and  psychometric  test  scores.  Correlations  for  global  task 
scores  with  averaged  reasoning  and  perceptual-motor  speed  scores  were  promising. 
Correlations  with  the  reasoning  tests  ranged  from  -.47  to  -.72  with  a  median  of 
-.64;  correlations  with  the  perceptual-motor  speed  tests,  however,  ranged  only 
from  .16  down  to  -.13,  with  a  median  of  .00.  The  overall  correlations  thus 
demonstrated  both  convergent  validity  (the  task  scoies  correlated  with  reasoning 
test  scores,  as  predicted)  and  discriminant  validity  (the  task  scores  did  not 
correlate  with  perceptual-motor  speed  test  scores,  as  predicted).  The  correla¬ 
tional  patterns  for  the  component  scores  were  also  promising.  A  combined 
inference-rapping-appli cation  parameter(with  combination  having  been 
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computed  to  increase  reliability  of  the  estimate)  correlated  with  the  combined 
psychometric  reasoning  score  -.70  for  analogies,  -.50  for  aeries  completions, 
and  -.64  for  classifications  (in  each  case,  collapsed  over  the  three  types  of 
contents).  The  comparison  parameter  also  correlated  significantly  and  substan¬ 
tially  with  psychometrically  measured  reasoning,  with  correlations  of  -.61,  -.66, 
and  -.67  for  analogies,  series  completions,  and  classifications  (collapsed  over 
contents)  respectively.  Each  of  encoding  and  justification  show  mixed  patterns 
of  correlations  (some  statistically  significant,  some  not).  The  response  parame¬ 
ter  aid  not  correlate  significantly  with  reasoning  in  any  case:  These  correla¬ 
tions  were  -.39,  -.01,  and  -.25  for  analogies,  series  completions,  and  classifi¬ 
cations,  respectively.  No  parameters  correlated  significantly  with  the  psycho¬ 
metric  perceptual-motor  speed  scores. 

These  patterns  of  correlations  were  theoretically  important  for  two 
reasons.  First,  they  showed  the  desired  pattern  of  convergent-discriminant 
validation  with  psychometrically-measured  reasoning  and  perceptual-motor  speed: 
Substantial  correlations  were  obtained  with  the  former  but  not  the  latter. 

Second,  and  even  more  importantly  in  view  of  the  results  of  the  earlier  analogy 
experiments,  convergent  and  discriminant  validation  were  demonstrated  in  the 
particular  pattern  of  correlation  between  the  task  parameters  and  psychometri- 
eally-aeasured  reasoning:  The  reasoning  parameters  showed  substantial  correla¬ 
tions  with  psychometrically-measured  reasoning,  but  the  response  parameter  did 
not.  The  use  of  highly  reliable  parameters  and  psychometric  test  scores,  and 
the  use  of  a  wide  range  of  item  difficulties,  seemed  to  have  paid  off. 

The  above  analyses  concentrated  upon  the  use  of  response  time  as  a 
dependent  measure,  and  assumed  individuals  represented  information  mentally 
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in  an  attribute-value  format.  In  a  separate  experiment,  Mike  Gardner  and  I 
constructed  mammal-name  analogies,  series  completions,  and  classifications, 
and  used  Henley's  (1969)  multidimensional  scaling  of  the  space  of  mammal  names 
as  a  representational  basis  for  predicting  subjects'  response  choices  in  rank- 
ordering  the  goodness  of  each  of  four  alternative  answer  options  as  completions 
to  each  item.  In  this  scaling,  mammal  names  are  embedded  in  a  three-dimensional 
space  having  as  its  dimensions  size,  ferocity,  and  humanness.  A  gorilla,  for 
example,  would  be  fairly  high  on  all  three  dimensions,  whereas  a  mouse  would 
be  fairl.’  low.  A  giraffe  would  be  high  on  size  but  low  on  ferocity  and  human¬ 
ness.  Examples  of  the  items  we  used  are  shown  in  Table  1. 


Insert  Table  1  about  here 


Thus,  we  were  interested  here  in  predicting  response  choices  rather  than 
response  times,  and  we  assumed  a  multidimensional  spatial  mental  representation 
rather  than  an  attribute-value  one.  The  theoretical  basis  for  this  work 
derived  from  the  work  of  Rumelhart  and  Abrahamson  (1973)  on  a  theory  of  response 
choice  in  analogical  reasoning.  We  extended  this  theory  to  series  completions 
and  classifications.  The  theory,  which  postulated  an  exponential  decay  function 
relating  ranking  of  goodness  of  response  choices  to  distance  of  these  response 
choices  from  the  ideal  response  in  the  three-dimensional  semantic  space,  account¬ 
ed  for  .97,  .98,  and  .99  of  the  variance  in  the  analogies,  series  completion, 
and  classification  data  respectively.  Thus,  it  apoears  that  the  kind  of 
componential  breakdown  I  have  proposed  can  be  applied  to  response  choice  as 
well  as  response  time,  and  to  a  multidimensional  spatial  as  well  as  an  attribute- 
value  representation. 


i 


Table  1 
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Sample  Animal  Names  Items 


Analogy 


Tiger  :  Chimpanzee  : :  Wolf  : 

(I)  Raccoon  (2)  Camel  (3)  Monkey  (4)  Leopard 

[The  subject's  task  is  to  rank-order  the  options  in  terns  of  their  goodness 
as  completions  to  the  analogy.] 


Series  Completion 


Squirrel  :  Chipmunk  :: 

(1)  Raccoon  (2)  Horse  (3)  Dog  (4)  Camel 

[The  subject's  task  is  to  rank-order  the  options  in  terms  of  their  goodness 
as  completions  to  the  series.] 


Classification 


Zebra  Giraffe  Goat 
(1)  Dog  (2)  Cow  (3)  Mouse 


(4)  Deer 


[The  subject’s  task  is  to  rank-order  the  options  in  terras  of  their  goodness 
as  members  of  the  class  formed  by  the  three  terms  in  the  item  stem.) 
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Extension  of  theory  to  metaphors.  In  comprehending  and  appreciating  a 
metaphor,  we  conceive  of  something  new  in  terms  of  something  old.  In  the 
metaphor,  "Man  is  a  wolf,"  for  example,  the  "new"  term,  or  tenor  of  the  meta¬ 
phor,  man,  is  seen  in  terms  of  the  old  term,  or  vehicle  of  the  metaphor,  wolf . 

The  basis  for  the  comparison  between  man  and  wolf,  or  ground  of  the  metaphor, 
is  left  Implicit. 

Because  the  conception  of  something  new  in  terms  of  something  old  forms 
the  basis  for  analogical  thinking  as  well  as  for  metaphorical  thinking,  and 
because  analogical  thinking  has  generally  been  thought  to  comprise  a  broader 
range  of  mental  phenomena  than  has  metaphorical  thinking,  some  students  of 
metaphor  have  been  inclined  to  view  metaphorical  understanding  as  a  form  of 
analogical  thinking  (e.g.,  Aristotle,  1927;  Billow,  1975;  Gentner,  1977;  Miller, 
1979;  Sapir,  1977;  Sternberg,  Tourangeau,  &  Nigro,  1979;  Tourangeau  &  Sternberg, 
1981,  1982  ;  Sternberg  &  Nigro,  in  press.)  On  this  view,  the  metaphor  "Man 

is  a  wolf"  can  be  viewed  as  an  implicit  analogy  in  which  some  properties  of 
man  are  seen  as  analogous  to  some  properties  of  a  wolf;  the  metaphor,  "The  lion 
is  the  king  of  beasts,"  can  be  understood  as  the  incomplete  analogy,  "lion  : 
beasts  ::  king  :  ?"  (Miller,  1979). 

Some  theorists  have  proposed  that  to  view  metaphors  as  nothing  more  than 
abridged  analogies  is  to  miss  the  essence  of  metaphors.  They  propose  that  in 
metaphors  there  is  an  interaction  between  tenor  and  vehicle  so  that  the  resulting 
meaning  of  the  metaphor  involves  a  blending  of  the  two  terms  (Black,  1962; 
Richards,  1936).  Some  recent  studies  by  Malgady  and  Johnson  (1976),  Verbrugge 
and  McCarrell  (1977), and  others,  have  supported  this  view.  Thus,  one  might 
theorize  that  metaphors  are  often  built  upon  a  foundation  of  analogy,  but  that 
they  involve  an  Interaction  between  terms  that  is  either  minimally  present  or 
entirely  absent  in  the  base  analogies. 

My  students  and  I  have  proposed  that  the  above  view — of  metaphors  as  ana- 
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logically  based  with  the  addition  of  some  kind  or  kinds  of  elements  of  interac¬ 
tion — can  give  a  good  account  of  metaphorical  comprehension  and  appreciation. 
Consider  an  analogy  presented  earlier  as  re-expressed  in  multiple-choice 
format:  "lion  :  beasts  ::  king  :  (a)  rulers,  (b)  humans."  When  the  metaphor 
is  expressed  in  this  form,  the  theory  of  analogical  reasoning  described  earlier 
can  be  directly  applied  to  the  understanding  of  the  metaphor.  The  subject 
must  encode  the  given  terms,  infer  the  relation  of  lion  to  beasts,  map  the  higher- 
order  relation  that  links  a  lion  to  a  king,  apply  the  previously  inferred  relation 
as  mapped  to  the  new  domain  to  generate  an  ideal  answer,  compare  this  answer  to 
each  of  the  alternatives,  justify  one  of  the  given  answers  as  better  than  the 
other,  although  possibly  nonideal,  and  respond .  The  theorized  identity  of 
performance  components  does  not  imply  equivalence  in  the  difficulty  of  the  meta¬ 
phor  and  its  corresponding  analogy.  On  the  one  hand,  the  additional  verbal  material 
contained  in  the  metaphor  increases  the  reading  load  of  this  presentation  format; 
on  the  other  hand,  this  additional  mediating  context  may  make  the  metaphor  more 
readily  comprehensible.  Hence,  the  relative  difficulties  of  the  two  presen¬ 
tation  formats  will  depend  upon  the  relative  effects  of  increased  reading  load 
and  increased  mediating  context. 

Proportional  metaphors  are  often  presented  in  ways  that  leave  at  least  some 
of  the  terms  of  the  underlying  analogy  implicit.  The  "lion  and  king"  metaphor, 
for  example,  could  be  presented  in  any  of  the  following  formats  (among  others), 
where  either  no  terms  or  some  terms  are  left  implicit: 

1.  A  lion  among  beasts  is  a  kind  among  people. 

2.  A  lion  among  beasts  is  a  king. 

3.  A  lion  is  a  king  among  people. 

4.  A  lion  is  a  king. 

5.  A  lion  is  a  king  among  beasts. 
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An  important  thing  to  notice  in  these  various  metaphorical  forms  is  that 
different  terms  are  left  implicit  in  different  forms  of  presentation.  These 
different  forms  may  differ  in  their  comprehensibility,  as  well  as  in  their 
aptness,  as  a  function  of  the  terms  that  are  left  implicit,  and,  in  the  fifth 
form,  as  a  function  of  the  reordering  of  terms:  "Beasts,"  the  second  term  of 
the  implicit  analogy,  is  presented  last.  On  the  present  theory,  the  reason  for 
these  variations  in  comprehensibility  and  information-processing  difficulty 
would  be  found  in  the  fact  that  these  forms  require  not  only  comprehension  of 
the  explicit  terms  and  of  the  relations  that  can  be  formed  between  these  terms, 
but  also  the  generation  of  terms  that  are  left  implicit,  and  the  comprehension 
of  relations  between  these  pairs  of  terms  (as  well  as  between  implicit  and 
explicit  ones).  Miller  (1979)  seems  to  share  a  similar  view. 

The  views  presented  above  led  Georgia  Nigro  and  I  to  several  predictions 
about  metaphorical  information  processing  (Sternberg  &  Nigro,  in  press}.  First, 
the  information-processing  components  used  in  the  understanding  of  metaphors 
and  especially  metaphors  with  relatively  fewer  implicit  terms  should  be  highly 
overlapping  with  the  components  used  in  the  understanding  of  analogies.  Second, 
metaphors  should  become  more  comprehensible  and  be  viewed  as  more  apt  as  the 
number  of  terms  of  the  iderlying  analogy  that  are  made  explicit  is  increased, 
thereby  clarifying  the  meaning  of  the  metaphor.  Third,  metaphors  should  become 
more  comprehensible  and  be  viewed  as  more  apt  as  tenor-vehicle  interaction  is 
made  more  clear  and  vivid  by  the  language  in  which  t'e  metaphor  is  presented. 
These  hypotheses  were  tested  in  two  experiments. 

The  first  experiment  investigated  the  first  hypothesis.  Base  statements 
were  presented  in  either  metaphorical  or  analogical  form  with  two  forced- 
choice  options  for  completion  of  the  statements.  All  elements  in  the  metaphors 
from  the  underlying  analogy  were  made  explicit.  Thus,  a  subject  might  see 
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either  "Bees  in  a  hive  are  a  Roman  mob  in  the  (a)  Coliseum,  (b)  streets,"  or 
"Bees  :  Rive  ::  Roman  mob  :  (a)  Coliseum,  (b)  streets."  Subjects  were  asked 
to  complete  the  statements  as  quickly  and  as  accurately  as  possible.  Half 
of  the  96  subjects  saw  the  metaphorical  format  and  half  saw  the  analogical 
format.  Each  subject  was  presented  with  a  6et  of  50  different  test  items. 

Mean  response  latencies  were  3.84  and  3.90  seconds  for  the  metaphorical 
and  analogical  item  formats,  respectively.  The  difference  between  these 
latencies  was  nonsignificant.  Error  rates  were  .06  in  each  condition,  and 
these,  too,  obviously  did  not  differ  significantly.  These  mean  data  are 
thus  consistent  with  the  notion  that  similar  or  identical  processing  com¬ 
ponents  are  used  in  each  task.  The  correlation  between  latencies  (computed 
across  item  types)  was  .80;  that  between  error  rates  was  not  meaningful 
because  of  the  very  low  error  rates  on  individual  item  types.  The  correlation 
between  latencies  needs  to  be  considered  in  conjunction  with  the  internal- 
consistency  reliabilities  of  the  latency  data,  which  were  .90  for  the  meta¬ 
phors  and  .93  for  the  analogies.  The  comparison  between  the  task  intercorrela¬ 
tion  and  the  task  reliabilities  shows  that  although  processing  of  metaphors 
and  analogies  was  probably  highly  similar  in  nature,  it  was  not  identical, 
since  there  was  still  some  systematic  variance  left  unaccounted  for.  As 
mentioned  earlier,  at  least  some  difference  would  be  expected,  since  the  meta¬ 
phors  supplied  mediating  context  that  was  absent  in  the  analogies,  and  may  have 
increased  tenor-vehicle  interaction. 

The  data  were  mathematically  modeled  by  predicting  response  latencies  from 

the  independent  variables  specified  by  the  proposed  theory  of  analogical  and 

metaphorical  reasoning.  The  overall  fit  of  the  model  to  each  data  set  was  quite 

2 

good:  Squared  correlations  (R  )  between  predicted  and  observed  latencies  were 
.86  for  the  metaphors  and  .73  for  the  analogical  format.  Root-mean-square  devia¬ 
tions  (RMSDs)  of  observed  from  predicted  values  were  .30  and  .60  seconds  in  the 


36 


metaphorical  and  analogical  conditions,  respectively.  These  model  fits  are 

based  upon  only  the  four  strongest  regression  parameters  of  the  proposed 

model.  If  all  parameters  allowed  by  the  theory  are  entered  in,  the  values  of 
2 

R  increase  to  .87  for  metaphors  and  .83  for  analogies.  Clearly,  the  proposed 
model  does  a  reasonable  job  of  accounting  for  latency  data  reflecting  the 
comprehend  ,>r.  of  both  the  metaphors  and  the  analogies. 

A  second  experiment  investigated  the  second  and  third  hypotheses,  as  well 
as  providing  additional  confirming  evidence  for  the  first  hypothesis.  In  this 
experiment,  base  statements  were  presented  in  each  of  the  five  different 
metaphorical  formats  presented  earlier,  where  the  formats  differed  in  the  num¬ 
ber  and  identities  of  the  terms  of  the  underlying  analogy  that  were  left  implicit. 
Subjects  were  asked  to  rate  either  the  aptness  or  the  comprehensibility  of  each 
metaphorical  statement.  Half  of  the  48  subjects  rated  aptness,  the  other  ha*., 
comprehensibility,  for  each  of  the  50  metaphors  of  Experiment  1  re-presented  in 
the  five  formats  described  earlier  (e.g.,  (1)  Bees  in  a  hive  are  a  Roman  mob 
in  the  Coliseum;  (2)  Bees  in  a  hive  are  a  Roman  mob;  (3)  Bees  are  a  Roman  mob 
in  the  Coliseum;  (4)  Bees  are  a  Roman  mob;  and  (5)  Bees  are  a  Roman  mob  in  a  hive). 

For  aptness,  the  effect  of  metaphorical  form  was  highly  significant.  An 
examination  of  the  pattern  of  ratings  revealed  that  for  Forms  1-4, 
higher  ratings  were  attained  for  metaphorical  formats  in  which  fewer  terms  were 
left  Implicit,  as  predicted.  Thus,  when  terms  are  presented  in  the  natural 
A-B-C-D  order  corresponding  to  the  order  of  terms  in  the  implicit  analogy,  the 
presentation  of  more  terms  is  associated  with  higher  aptness.  But  Form  5,  where 
the  order  of  the  second  and  third  terms  was  reversed  relative  to  the  implicit 
underlying  analogy  ("Roman  mob,"  in  the  example,  precedes  rather  than  follows 
"hive"),  was  rated  as  most  apt.  The  high  Form  5  rating  did  not  merely  reflect 
its  intermediate  number  of  terms.  We  suggest  that  the  Form  5  metaphor  was  rated 
as  most  apt  because  the  ordering  of  terms  suggested  something  more  than  is  suggested 


37 


in  the  other  metaphorical  forms:  In  particular,  it  suggested  an  interaction  be¬ 
tween  tenor  and  vehicle  more  than  that  suggested  by  the  other  forms.  In  metaphors 
such  as  "A  pear  is  a  Buddha  on  a  sill,"  or  "Bees  are  a  Roman  mob  in  a  hive,"  or 
"Tombstones  are  teeth  in  a  graveyard,"  the  tenor  i6  more  easily  conceived  in 
terms  of  the  vehicle,  and  it  is  especially  easy  in  many  cases  to  create  an 
image  of  tenor-vehicle  interaction.  One  can  easily  imagine  a  Buddha  transplanted 
to  a  window  sill,  a  Roman  mob  scurrying  mindlessly  in  a  hive,  or  teeth  sticking 
up  from  the  ground  in  a  graveyard.  The  fifth  form  thus  provides  an  ordering 
of  terms  that  facilitates  one's  understanding  of  tenor-vehicle  interaction,  and 
thus  aptness  is  increased.  In  the  other  metaphorical  forms,  adherence  to  the 
order  of  terms  in  the  underlying  analogy  reduces  ease  of  perceived  interaction 
of  tenor  and  vehicle,  and  aptness  is  correspondingly  reduced.  In  order  to  test 
our  hypothesis  that  the  fifth  metaphorical  form  encouraged  formation  of  interac¬ 
tive  imagery  more  than  did  the  second  metaphorical  form  (which  contained  exactly 
the  6arae  terms  in  the  standard  analogical  order)  or  than  did  any  other  form, 
we  had  a  separate  group  of  20  subjects  rate  vividness  of  interactive  imagery  for 
each  metaphor  in  each  format.  Mean  ratings  were  highest  for  Form  5.  Most  criti¬ 
cally,  the  mean  for  Form  5  was  higher  than  for  Form  2,  which  differed  only  in 
order  but  not  in  content  of  terms.  The  same  patterns  of  results  obtained  for 
aptness  obtained  as  well  for  comprehensibility. 

We  also  mathematically  modeled  the  ratings  of  aptness  and  comprehensibility 

on  the  basis  of  the  independent  variables  in  our  theory.  For  the  full  model 

plus  the  effect  of  interactive  imagery  plus  the  effect  of  comprehensibility  on 

2 

aptness  or  the  effect  of  aptness  on  comprehensibility,  values  of  R  between  pre¬ 
dicted  and  observed  data  for  the  five  respective  metaphorical  formats  were  .71, 
.77,  .69,  .78,  and  .71  for  aptness,  and  .78,  .80,  .73,  .86,  and  .82  for  compre¬ 
hensibility.  These  model  fits  were  quite  impressive  in  view  of  the  fact  that 
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the  theory  was  originally  formulated  to  predict  response  latencies  rather  than 
ratings  of  aptness  or  comprehensibility. 

The  results  of  the  second  experiment  thus  confirmed  our  second  and  third 
hypotheses,  as  well  as  providing  additional  confirming  evidence  for  our  first 
hypothesis.  The  componential  theory  of  analogical  reasoning  6eems  to  lend  itself 
to  extension  to  metaphorical  comprehension,  as  well  as  to  metaphorical  aptness 
and  comprehensibility  as  measured  by  subjects'  ratings. 

The  research  described  above  emphasized  processing  of  metaphorical  informa¬ 
tion.  Roger  Tourangeau  and  I  have  concentrated  upon  the  representation  of  meta¬ 
phorical  information  in  a  separate  series  of  studies  (Tourangeau  &  Sternberg, 
1981,  1982}..  Imagine  an  array  of  "local  subspaces"  comprising  sets  of 

terms,  such  as  U.S.  historical  figures,  modern  world  leaders,  mammals,  birds, 
fish,  airplanes,  land  vehicles,  and  ships.  Each  local  subspace  represents  the 
terms  within  it  as  points  with  coordinates  on  each  of  several  dimensions.  Each 
of  these  local  subspaces  might  also  be  viewed  as  of  roughly  the  same  order 
(level  of  abstraction),  and  as  of  a  lower  order  than  a  higher-order  hyperspace 
that  contains  the  lower -order  subspaces  as  points  embedded  within  it.  Thus, 
the  points  of  the  higher-order  hyperspace  map  into  the  lower-order  subspaces, 
and  can  be  labeled  by  the  names  of  these  subspaces.  This  hyperspace  can,  in 
turn,  be  viewed  as  one  of  multiple  subspaces  of  some  still  higher-order  hyper¬ 
space,  although  such  very  high-order  hyperspaces  will  not  concern  us  here. 

We  shall  also  need  some  rule  for  restricting  the  subsp :s  that  map  into 
a  single  hyper space,  and  some  way  of  establishing  comparability  across  subspaces. 
Both  of  these  goals  can  be  accomplished  by  requiring  all  subspaces  to  have  at 
least  one  corresponding  dimension.  Thus,  for  example,  the  subspaces  of  modern 
world  leaders,  bird  names,  and  ships  must  have  at  least  one  corresponding  di¬ 
mension  if  they  are  to  be  local  subspaces  of  the  same  order  and  of  a  common 


hyper space. 
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In  a  semantic-differential  paradigm  (Osgood,  Suci,  &  Tannenbaum,  1957),  sub¬ 
jects  were  asked  to  rate  each  of  twenty  items  within  each  domain  on  twenty-one 
scales,  such  as  warlike-peaceful ,  noble-ignoble,  and  strong-weak,  with  a  different 
group  of  sixteen  subjects  supplying  ratings  for  each  of  the  eight  domains.  We 
hoped  in  this  way  to  obtain  a  corresponding  set  of  dimensions  for  the  eight  do¬ 
mains  named  above  (U.S.  historical  figures,  modern  wo.ld  leaders,  mammals,  birds, 
fish,  airplanes,  land  vehicles,  and  ships).  It  seemed  plausible  to  us  that  at 
least  two  such  corresponding  dimensions  would  obtain:  prestige  (similar  to  Os¬ 
good  et  al.'s  "evaluative"  dimension)  and  aggression  (similar  to  Osgood  et  al.'s 
"potency"  or  "activity"  dimensions) .  The  adjective  pairs  for  each  domain  were 
then  factor  analyzed. 

Visual  inspection  of  the  results  of  the  factor  analyses  supported  our 
hypothesis:  Two  corresponding  dimensions  of  prestige  and  aggression  appeared 
for  each  domain,  although  the  order  in  which  the  two  dimensions  appeared  was 
variable  across  domains.  In  order  to  confirm  our  visual  impression,  statisti¬ 
cal  analyses  were  performed  to  ascertain  degrees  of  dimensional  interrelatedness. 
Corresponding  dimensions  according  to  the  visual  analysis  were  found  to  be 
highly  statistically  related  as  well,  and  noncorresponding  dimensions  to  be 
only  poorly  statistically  related. 

This  representational  framework  was  used  as  a  basis  for  constructing  rules 
that  would  identify  metaphors  as  more  or  less  aesthetically  pleasing.  Two  dis¬ 
tances  in  the  proposed  spaces  were  deemed  relevant:  "superimposed  within-subspace 
distance"  between  the  tenor  (first  term)  of  the  metaphor  and  the  vehicle  (second 
term)  of  the  metaphor,  and  "between-subspace  distance"  between  these  two  terms. 

Consider  first  the  meaning  of  "superimposed  within-subspace  distance."  Since 
at  least  two  dimensions  are  corresponding  (or  at  least,  very  similar)  for  each 
domain,  one  can  imagine  superimposing  the  dimensions  of  one  local  subspace  onto 
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the  corresponding  dimensions  of  another  local  subspace.  Once  this  superimposition 
is  accomplished,  it  is  also  possible  to  imagine  computing  the  superimposed  within- 
subspace  distance  between  two  points  that  are  actually  in  different  subspaces. 

One  simply  computes  the  distance  between  points  as  though  they  were  in  the 
same  subspace.  Thus,  if  the  coordinates  of  some  point  in  one  subspace  were 
(x,  y) ,  then  the  superimposed  within-subspace  distance  to  some  point  in  another 
subspace  would  be  0  if  that  point  also  happened  to  occupy  location  (x,  y) ,  and 
would  depart  from  0  as  the  Euclidean  distance  of  that  point  from  (x,  y)  increased. 

An  example  may  help  clarify  the  concept.  The  superimposed  within-subspace 
distance  from  wildcat  to  hawk  is  very  small,  because  the  coordinates  of  hawk 
in  the  bird  subspace  are  very  close  to  those  of  wildcat  in  the  mammal  subspace. 

The  superimposed  within-subspace  distance  from  wildcat  to  robin  is  quite  large, 
however,  because  the  coordinates  of  wildcat  and  robin  are  quite  disparate. 
Similarly,  the  superimposed  within-subspace  distance  from  wildcat  to  ICBM  is 
small,  whereas  the  superimposed  within-subspace  distance  from  wildcat  to  blimp 
is  large.  The  distance  concept  is  further  illustrated  in  Figure  4. 

Insert  Figure  4  about  here 

Consider  next  the  meaning  of  "between-subspace"  distance.  In  order  for 
the  concept  to  have  meaning,  it  must  be  possible  somehow  to  compute  the  distance 
between  a  pair  of  subspaces.  This  computation  is  possible,  in  our  representational 
formulation,  because  the  distance  between  two  subspaces  is  equal  to  the  distance 
between  the  corresponding  points  within  the  appropriate  hyper space.  Thus,  if  the 
coordinates  of  some  local  subspace  in  the  hyperspace  are  (x,  y) ,  the  distance 
from  that  subspace  to  another  subspace  increases  as  the  Euclidean  distance  of 
that  subspace  from  (x,  y)  increases. 

Let  us  return  to  our  earlier  example  to  illustrate  the  concept  of  between- 
subspace  distance.  The  between-subspace  distance  from  wildcat  to  hawk  is  the  same 
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as  that  from  wildcat  to  robin,  since  both  hawk  and  robin  are  in  the  same  local 
subspace.  This  distance  is  small,  since  mammal  and  bird  names  are  viewed  as 
relatively  close  to  one  another  in  the  hyperspace.  The  between-subspace  distances 
from  wildcat  to  ICBM  and  blimp  are  also  the  same,  since  these  latter  two  terms 
fall  within  the  same  local  subspace;  and  this  distance  is  relatively  large,  since 
mammal  names  and  names  of  airplanes  are  viewed  as  relatively  far  from  one  another 
in  the  hyperspace.  The  distance  concept  is  illustrated  further  in  Figure  4. 

Turning  now  to  the  theory  of  metaphorical  aptness,  we  proposed  that  a 
metaphor  is  aesthetically  pleasing ,  or  apt ,  to  the  extent  that  the  superimposed 
with in-subspace  distance  is  small,  but  the  between-subspace  distance  is  large . 
Consider  some  examples  of  metaphors  derived  from  the  terms  discussed  above: 

(1)  A  wildcat  is  a  hawk  among  mammals;  (2)  A  wildcat  is  a  robin  among  mammals; 

(3)  A  wildcat  is  an  ICBM  among  mammals;  and  (4)  A  wildcat  is  a  blimp  among  mammals. 
What  empirical  claims  does  the  proposed  theory  make  about  each  of  these  metaphors? 
According  to  the  theory,  (3)  should  be  the  metaphor  of  highest  quality,  since 
although  wildcat  and  ICB?1  are  quite  close  to  one  another  in  terms  of  superimposed 
within-subspace  distance,  they  are  from  distant  local  subspaces.  Metaphor  (2) 
should  be  lowest  in  quality,  because  the  tenor  and  vehicle  occupy  discrepant 
positions  in  their  respective  subspaces,  and  are  from  proximal  subspaces.  Metaphor 
(1)  and  (4)  should  be  intermediate  in  quality.  Since  we  expect  superimposed  within 
subspace  distance  to  carry  more  weight  than  between-subspace  distance,  we  would 
predict  that  metaphor  (1)  would  be  perceived  as  more  apt  than  metaphor  (4).  Thus, 
the  ordering  of  metaphors  in  terms  of  aptness  is,  from  greatest  to  least,  (3),  (1), 

(4) ,  (2). 

The  predictions  of  our  "dual-distance"  theory  were  tested  in  two  experiments. 
In  Experiment  1,  37  subjects  were  asked  to  rate  the  aptness  of  metaphors  such 
as  "The  owl  is  the  horse  among  birds."  Tenors  and  vehicles  were  taken  from  the 
local  aubspaces  described  earlier.  According  to  our  theory,  metaphorical  aptness 
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should  be  negatively  correlated  with  the  superimposed  vithin-subspace  distance 
between  tenor  and  vehicle,  and  positively  correlated  with  the  between-subspace 
distance.  Both  predictions  were  confirmed.  Correlations  were  -.39  for  super¬ 
imposed  within-subspace  distance,  and  .27  for  between-subspace  distance  (both 
statistically  significant,  if  small).  When  multiple  regression  was  used  to 
predict  aptness  from  these  two  distances  plus  comprehensibility  of  the  metaphor, 
the  multiple  correlation  was  .76.  A  fact  of  incidental  interest  is  that  the 
simple  correlation  of  overall  distance  with  metaphorical  goodness  was  just  -.01. 
This  trivial  correlation  suggests  why  many  previous  investigations  that  have 
failed  to  make  our  distinction  between  the  two  kinds  of  distances  have  been 
inconclusive  and  theoretically  inadequate. 

In  Experiment  2,  20  subjects  were  asked  to  rank-order  the  goodness  of 
metaphorical  completions  in  terms  with  the  format  exemplified  by  "A  crab  is 

a _ among  sea  creatures.  (1)  tiger,  (2)  mongoose,  (3)  rat,  (4)  horse." 

Half  the  items  had  options  chosen  from  a  single  local  subspace,  as  in  the  ex¬ 
ample,  and  half  had  options  chosen  from  multiple  local  subspaces,  for  example, 

"A  blue  whale  is  a  _  among  sea  creatures.  (1)  killer  whale,  (2)  Giscard 

d’Estaing,  (3)  satellite,  (4)  lion."  The  rank-order  correlation  between  super¬ 
imposed  within-subspace  distance  an! option  popularity  (aptness)  was  -.46  for 
metaphors  of  the  first  kind  (options  from  a  single  local  subspace)  and  -.48 
for  metaphors  of  the  second  kind  (options  'rcm  multiple  local  subspaces). 

The  correlation  with  between-subspace  distance,  which  could  be  computed  only 
for  metaphors  of  the  second  kind,  was  a  nonsignificant  .06.  An  exponential 
model  was  also  fit  to  the  response-choice  data,  with  choice  proportions  predicted 
on  the  basis  of  the  two  kinds  of  distances.  The  model  was  successful  for  the 
options  of  the  first  kind  (correlation  between  predicted  and  observed  values 
equal  to  .98)  but  not  for  the  options  of  the  second  kind,  where  between-  as  well 
as  superimposed  within-subspace  distance  was  manipulated. 
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Although  the  theory  was  applied  only  to  items  falling  within  semantic  fields, 
its  general  principles  can  be  applied  to  items  outside  such  fields.  For  example, 
the  theory  predicts  that  Donne's  famous  conceit  relating  lovers  to  stiff  twin 
compasses  will  be  apt  because  the  superimposed  withln-subspace  distance,  as  6hown 
by  Donne,  is  low  (i.e.,  lovers  and  stiff  twin  compasses  can  be  shown  to  bear 
many  similarities),  but  the  between-subspace  distance  is  high  (i.e.,  lovers  and 
stiff  twin  compasses  are  from  very  distant  domains).  At  the  opposite  extreme, 
literal  statements  make  for  poor  metaphors  because  their  between-subspace  dis¬ 
tance  is  zero,  regardless  of  what  their  within-subspace  distance  may  be.  For 
example,  "An  1CBM  is  an  intercontinental  missile"  has  zero  superimposed  within- 
subspace  distance  (which  is  good  for  metaphorical  aptness)  ,  but  zero  between- 
subspace  distance  as  well  (which  is  bad  for  metaphorical  aptness) .  Anomalous 
statements  such  as  "An  ICBM  is  a  haystack"  make  for  poor  metaphors  because  what¬ 
ever  may  be  their  between-subspace  distance,  their  superimposed  within-subspace 
distance  will  generally  be  very  high. 

To  conclude,  we  have  proposed  theories  of  metaphorical  information  processing 
and  representation  that  seem  to  capture  major  aspects  of  metaphorical  comprehension 
and  appreciation.  The  theory  suggests  that  metaphorical  processing  can  be  related 
to  other  kinds  of  inductive  information  processing  in  terms  of  the  components  used 
in  metaphorical  understanding,  but  that  an  additional  element,  that  of  interaction, 
applies  uniquely  to  the  metaphorical  (or  figurative)  format  of  verbal  presentation. 
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Theory  of  causal  inference.  Civilian  and  military  personnel  alike  frequently 
need  to  infer  the  causal  antecedent  or  antecedents  of  significant  real-world  events. 
For  example,  the  response  of  the  United  States  government  to  the  recent  Soviet 
invasion  of  Afghanistan  was  predicated  upon  the  motives  U.S.  government  officials 
inferred  to  underlie  the  Soviet  attack.  The  response  would  almost  certainly  have 
been  different  if  government  policy-makers  had  believed  Soviet  claims  that  the 
Soviets  were  merely  responding  to  a  request  from  the  Afghan  government  for  help 
in  resolving  domestic  turmoil. 

We  have  proposed  that  people  use  four  basic  kinds  of  evidence  in  evaluating 
the  likelihood  that  a  given  event  is  a  causal  antecedent  for  a  given  consequent 
(Schustack  &  Sternberg,  1981;  Sternberg  &  Schustack,  1981); 

1 .  Conf innation  by  joint  presence  of  possibly  causal  event  and  outcome. 

The  potential  cause  and  the  given  outcome  tend  to  occur  in  conjunction.  For 
example,  because  widespread  increases  in  wages  tend  to  be  followed  bv  widespread 
increases  in  prices,  we  tend  to  attribute  the  price  increases  at  least  in  part 
to  the  increases  in  wages  (the  well-known  "inflationary  spiral").  This  relation 
between  the  possibly  causal  event  and  the  outcome  event  is  evidence  in  favor  of 
the  sufficiency  of  the  possibly  causal  event  for  the  outcome,  i.e.,  if  the  pos¬ 
sibly  causal  event  occurs,  so  does  the  outcome. 

2.  Confirmation  by  joint  absence  of  possibly  causal  event  and  outcome.  The 
absence  of  the  potential  cause  tends  to  be  associated  with  the  absence  of  the 
given  outcome.  For  example,  countries  that  are  disarmed  (or  almost  disarmed) 
tend  not  to  start  wars,  so  tnat  one  might  reasonably  conclude  that  the  starting 

of  wars  is  at  least  in  part  attributable  to  the  presence  of  armaments  in  a  country's 
arsenal.  This  relation  between  the  possibly  causal  event  and  the  outcome  event  is 
evidence  in  favor  of  the  necessity  of  the  possibly  causal  event  for  the  outcome, 
i.e.,  the  outcome  event  (here,  starting  wars)  tends  to  occur  only  if  the  antecedent 
event  is  present  (here,  armaments  in  the  country's  arsenal). 
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3.  Disconfirmation  by  presence  of  possibly  causal  event  but  absence  of  outcome . 
The  presence  of  the  potential  cause  tends  to  be  associated  with  the  absence  of  the 
given  outcome.  For  example,  the  presence  of  large  numbers  of  members  of  a  given 
ethnic  group  in  a  country  originally  foreign  to  them  does  not  (usually!)  lead  to 
overt  action  on  the  part  of  these  ethnics  to  take  over  the  country  by  force. 

This  relation  between  the  possibly  causal  event  and  the  outcome  event  is  evidence 
against  the  sufficiency  of  the  possibly  causal  event  for  the  outcome,  that  is, 
the  occurrence  of  the  possibly  causal  event  does  not  always  lead  to  the  oc¬ 
currence  of  the  outcome.  Thus,  in  our  country,  there  are  large  ethnic  populations, 
but  such  populations  have  not  been  associated  with  attempts  to  overthrow  the 
government . 

4 .  Disconfirmation  by  absence  of  possibly  causal  event  but  presence  of  outcome. 
The  absence  of  the  potential  cause  tends  to  be  associated  with  the  presence  of  the 
given  outcome.  For  example,  a  suspect’s  having  been  a  hundred  miles  away  from  the 
scene  of  a  murder  at  the  time  the  murder  occurred  would  tend  to  disconfirm  the 
inference  that  the  suspect  committed  the  crime.  This  relation  is  evidence 
against  the  necessity  of  the  possibly  causal  event  for  the  outcome,  i.e.,  the 
occurrence  of  the  outcome  (the  crime  in  this  example)  was  not  preceded  by  the  oc¬ 
currence  of  the  possibly  causal  event  (the  suspect's  having  been  present). 

A  fifth  kind  of  information  can  also  be  relevant  to  causal  inferences,  namely, 
base  rate.  Base  rate  refers  to  the  probability  of  a  given  outcome  occurring  in 
the  absence  of  any  new  information  regarding  the  probability  of  occurrence  in 
the  particular  situation.  One  simply  uses  one's  world  knowledge  about  cooccurrences 
of  events.  We  theorized,  in  concordance  with  past  literature  (e.g.,  Nisbett  & 

Ross,  1980),  that  base-rate  information  is  used  only  minimally  in  causal  inference. 

It  is  important  to  note  that  the  kinds  of  information  considered  above  can 
be  used  both  to  te6t  one's  preferred  hypothesis,  and  to  test  alternative  hypotheses 
as  well.  Thus,  the  various  kinds  of  evidence  can  be  combined  to  give  a  causal 
likelihood  for  each  of  a  set  of  potential  causal  antecedents. 


In  our  research  on  causal  inference,  subjects  had  to  evaluate  the  likelihood 
that  a  given  outcome  event  was  the  result  of  a  particular  hypothesized  causal  event. 
Individuals  had  to  make  these  judgments  with  incomplete  information  about  complex 
problems  varying  simultaneously  on  many  dimensions.  In  making  the  judgments, 
subjects  had  to  decide  what  kinds  of  evidence  to  consider,  how  to  weight  each 
kind  of  evidence,  how  to  combine  various  kinds  of  evidence,  and  how  to  translate 
their  conclusions  into  a  probability  that  the  target  hypothesis  was  responsible 
for  the  target  outcome. 

The  problems  used  in  our  research  resembled  in  many  respects  problems 
encountered  in  real-world  settings  that  require  causal  inferences.  We  used 
three  basic  kinds  of  problems  that  differed  in  the  content  domain  in  which 
the  causal  inference  was  to  be  made:  (a)  an  epidemiological  doir^in,  in  which 
the  individual  had  to  judge  the  likelihood  that  a  particular  hazard  was  responsi¬ 
ble  for  a  given  epidemic;  (b)  a  securities  domain,  in  which  the  individual  had 
to  judge  the  likelihood  that  a  particular  circumstance  was  responsible  for 
a  precipitous  decline  in  the  value  of  a  company  stock;  and  (c)  an  abstract  domain, 
in  which  the  individual  had  to  judge  the  likelihood  that  a  particular  circumstance 
(labeled  only  by  a  letter)  was  responsible  for  some  other  particular  circumstance 
(also  labeled  only  by  a  letter).  Consider  a  sample  problem  from  the  securities  domain 

A  market  analyst  noted  that,  among  pharmaceutical  manufacturers: 

In  Company  1, 

The  office  staff  of  the  company  organized  and  joined  a  union. 

The  company's  major  product  was  under  suspicion  as  a  carcinogen. 

There  vaB  a  drastic  drop  in  the  value  of  the  company's  stock. 

In  Company  2, 

The  office  staff  of  the  company  did  not  organize  or  join  a  union. 

The  company's  major  product  was  under  suspicion  as  a  carcinogen. 

There  was  a  drastic  drop  in  the  value  of  the  company '6  stock. 
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In  Company  3, 

Illegal  campaign  contributions  were  traced  to  the  company's  managers. 

The  company's  major  product  vas  not  under  suspicion  as  a  carcinogen. 

There  was  not  a  drastic  drop  in  the  value  of  the  company's  stock. 

WHAT  IS  THE  PROBABILITY  THAT,  FOR  SOME  OTHER  PHARMACEUTICAL  MANUFACTURER,  STOCK 
VALUES  WOULD  DROP  DRASTICALLY  IF  THE  COMPANY'S  MAJOR  PRODUCT  WERE  UNDER  SUSPICION 
AS  A  CARCINOGEN? 

In  each  problem  of  each  kind,  individuals  were  presented  with  the  hypothesis 
that  a  particular  event  was  responsible  for  some  outcome.  They  were  asked  to 
use  a  given  body  of  evidence  to  estimate  the  probability  (placed  on  a  0-100  scale 
to  eliminate  decimal  points)  with  which  that  event,  by  itself,  would  produce  that 
outcome.  Individuals  were  explicitly  warned  that  they  were  being  given  incomplete 
information  and  that  interactions  between  possibly  causal  events  were  possible; 
these  warning  were  aimed  at  evoking  the  same  kind  of  mental  set  as  would  real- 
life  causal  inference. 

Within  any  situation  in  any  kind  of  problem,  each  possible  cause  was  in  one 
of  three  states:  observed  to  be  present  (e.g.,  "The  office  staff  of  the  company 
organized  and  joined  a  union"),  observed  to  be  absent  (e.g.,  "The  office  staff  of 
the  company  did  not  organize  or  join  a  union"),  or  not  observed  (e.g.,  nothing  is 
stated  about  unionization).  Over  the  set  of  problems  in  three  content  domains, 
there  were  from  two  to  five  situations  (cities,  companies,  or  lines  of  problems) 
described  in  a  single  problem,  and  also  from  two  to  five  possibly  causal  events  ob¬ 
served  per  situation  independently  of  the  number  of  situations.  Within  a  single 
problem,  each  situation  had  the  same  number  of  observed  possibly-causal  events. 

Sixty-two  subjects  supplied  probability  ratings  for  each  of  the  two  concrete 
content  domains,  with  order  of  domain  counterbalanced.  Forty  subjects  supplied 
probability  ratings  for  the  abstract  content  domain  only.  An  additional  21  subjects 
supplied  base-rate  ratings  for  the  concrete-content  domain,  these  subjects  were 
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asked  the  question  without  being  given  the  prior  information  about  the  companies 
(epidemics,  or  abstract  letters).  Other  experiments  used  similar  designs  (see 
Schustack  &  Sternberg,  1981,  for  a  full  description  of  the  complete  set  of 
experiments,  only  one  of  which  is  described  here),  with  similar  results. 

Mean  response  probabilities  were  .35  for  epidemics,  .37  for  securities, 
and  .35  for  abstract  items  (with  decimal  points  now  inserted  for  ease  of 

comprehension) .  The  means  did  not  differ  significantly  from  each  other. 

2 

Values  of  R  between  predicted  and  observed  probabilities  were  .90,  .88,  and 
.90  for  epidemics,  securities,  and  abstract  content  respectively.  Deviations 
from  the  model  were  statistically  significant,  despite  the  high  levels  of 
fit.  RMSD's  of  observed  from  predicted  values  were  .07,  .06,  and  .06  (with 
decimal  points  again  inserted  by  us)  for  the  three  respective  content  types. 

The  proposed  theory  of  causal  inference  was  tested  against  a  number  of  plausi¬ 
ble  alternative  theories,  and  was  found  to  provide  a  superior  account  of  the 
data.  These  alternatives  included  both  linear  and  nonlinear  models. 

It  is  worth  noting  that  parameter  estimates  for  the  four  kinds  of  evidence 
considered  by  the  theory  were  highly  similar  across  the  three  content  domains. 
Consistent  with  past  evidence  in  research  on  causal  inference  (e.g.,  Wason,  1960) 
people  weighted  positive  confirming  evidence  the  most  highly.  Also  consistent 
with  past  evidence,  base  rate  information  was  hardly  used  at  all.  Evidence  about 
hypotheses  alternative  to  the  one  being  considered  was  evaluated,  but  was  assigned 
much  less  weight  than  would  have  been  optimal.  On  the  whole,  we  found  that  people 
used  a  wide  variety  of  evidence  types  in  making  their  judgments,  but  the  weights 
they  assigned  these  types  of  information  were  rather  far  from  those  that  would  be 
assigned  if  all  types  got  equal  weight. There  were  relatively  small  individual 
differences  in  these  weights,  but  weight  values  for  individual  subjects  were  not 
correlated  with  scores  on  standard  IQ  tests,  nor  is  there  any  a  priori  reason  to 
expect  they  ought  to  be:  We  are  dealing  with  problems  in  which  no  one  set  of  weights 
would  most  clearly  be  associated  with  high  Intelligence. 


Theory  of  projection  in  £  nonentrenched  reasoning  task.  During  the  past 
several  years,  I  have  been  pursuing  the  notion  that  intelligence  is  in  large 
part  the  ability  to  acquire  and  reason  with  new  conceptual  systems.  It  is  not 
merely  the  ability  to  learn  and  reason  with  new  concepts  but  the  ability  to 
learn  and  reason  with  new  kinds  of  concepts.  Intelligence  Is  not  so  much  a 
person's  ability  to  learn  or  think  within  conceptual  systems  that  the  person 
has  already  become  familiar  with  as  it  is  his  or  her  ability  to  learn  and 
think  within  new  conceptual  systems,  which  can  then  be  brought  to  bear  upon 
existing  knowledge  structures.  Thus,  an  intelligent  person  must  first  learn 
a  new  conceptual  system  and  then  see  how  it  applies  (analogically)  to  old  problems. 

I  have  referred  to  tasks  requiring  novel  kinds  of  thinking  in  novel  domains  as 
"nonentrenched . " 

The  main  nonentrenched  task  I  have  studied  is  one  that  requires  the  indi¬ 
vidual  to  make  a  projection  that  characterizes  the  state  of  an  object  at  some  fu¬ 
ture  time  on  the  basis  of  incomplete  information  about  the  state  of  the  object 
both  at  that  time  and  at  some  earlier  time.  The  projection  task  was  studied 
with  three  different  "surface"  structures  having  very  similar  "deep"  structures. 
Consider  the  first  instantiation  of  the  task,  which  requires  projection  of  the 
color  an  object  will  appear  to  be  at  a  future  time. 

In  the  first  instantiation  of  the  task,  subjects  were  presented  with  a  descrip¬ 
tion  of  the  color  of  an  object  in  the  present  day  and  in  the  year  2000.  The  de¬ 
scription  could  be  either  physical--a  green  dot  or  a  blue  dot— or  verbal — one  of 
four  color  words,  namely,  green,  blue,  grue,  and  blcen.  An  object  was  defined  as 
green  if  it  appeared  physically  green  both  in  the  present  and  in  the  year  2000. 

An  object  was  defined  as  blue  if  it  appeared  physically  blue  both  in  the  present 
and  in  the  year  2000.  An  object  was  defined  as  grue  if  it  appeared  physically  green 
in  the  present  but  physically  blue  in  the  year  2000  (i.e.,  it  appeared  physically 
green  until  the  year  2000  and  physically  blue  thereafter).  An  object  was  defined 
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as  bleen  if  it  appeared  physically  blue  in  the  present  but  physically  green  in 
the  year  2000  (i  e.,  j.t  appeared  physically  blue  until  the  year  2000  and  physi¬ 
cally  green  thereafter).  (The  terminology  is  bared  upon  Goodman,  1955.) 

Since  each  of  the  two  descriptions  (one  in  the  present  and  one  in  the  year 
2000)  could  take  one  of  either  two  physical  forms  or  four  verbal  forms,  there  were 
36  (6  x  6)  different  item  types.  The  subject’s  task  was  to  describe  the  object 
in  the  year  2000.  If  the  given  description  for  the  year  2000  was  a  physical 
one,  the  subject  had  to  indicate  the  correct  verbal  description  of  the  object; 
if  the  given  description  for  the  year  2000  was  a  verbal  one,  the  subject  had  to 
indicate  the  correct  physical  description  of  the  object.  There  were  always  three 
answer  choices  from  which  the  subject  had  to  choose  the  correct  one. 

Subjects  were  alerted  to  a  complexity  in  the  projection  task  that  applies 
to  the  real  world  as  well.  When  one  observes  the  physical  appearance  of  an  ob¬ 
ject  in  the  present  day,  one  can  be  certain  of  its  current  physical  appearance 
but  not  of  what  its  physical  appearance  will  be  in  the  year  2000.  Hence,  all 
descriptions  presented  for  the  present  day  could  be  guaranteed  to  be  accurate 
with  respect  to  physical  appearance  in  the  present,  but  they  could  not  be  guar¬ 
anteed  to  be  accurate  with  respect  to  their  implications,  if  any,  regarding  physi¬ 
cal  appearance  in  the  future.  For  physical  descriptions  of  objects  as  they  appear 
in  the  present,  this  complexity  presents  no  problems,  since  the  physical  descrip¬ 
tion  of  an  object  (a  green  dot  or  a  blue  dot)  carries  no  implications  regarding 
the  future  physical  appearance  of  the  object.  For  verbal  descriptions  of  objects 
as  they  appear  in  the  present,  however,  this  complexity  does  present  a  problem. 

The  verbal  descriptions  green  and  blue  imply  constancy  in  physical  appearance, 
whereas  the  verbal  descriptions  grue  and  bleen  imply  change.  Unfortunately,  all 
one  can  infer  with  certainty  from  these  verbal  descriptions  is  the  current  physical 
appearance  of  the  object.  The  implication  for  the  future  physical  appearance 
of  the  object  can  only  be  a  guess,  which  may  be  right  or  wrong.  This  complexity 


ceases  to  exist  for  the  observer  in  the  year  2000  because  at  this  point  all  of 
the  evidence  is  in.  The  observer  in  the  year  2000  knows  for  certain  what  the 
physical  appearance  of  the  object  is  in  2000  and  also  knows  for  certain  what 
the  physical  appearance  of  the  object  was  in  what  was  once  the  present.  Hence, 
the  second  description,  that  of  the  object  in  the  year  2000,  i6  guaranteed  to 
be  correct  both  with  respect  to  the  object's  appearance  in  2000  and  the  object's 
appearance  in  what  was  once  the  present.  (The  one  exception  to  this  guarantee 
is  a  certain  problem  referred  to  as  "inconsistent,"  as  described  below.) 

To  summarize,  physical  descriptions,  which  carried  no  implications  for 
what  an  object  would  look  like  at  another  time,  were  always  accurate  in  all 
respects.  Verbal  descriptions,  which  did  carry  an  implication  for  the  appearance 
of  an  object  at  another  time,  were  always  accurate  with  respect  to  the  physical 
description  they  implied  for  the  object  at  the  time  at  which  the  description 
was  given  (except  for  inconsistent  items),  but  in  the  present,  they  might  not 
be  accurate  with  respect  to  the  physical  description  they  implied  for  the  year  2000. 

Some  examples  of  actual  items  will  illustrate  a  few  item  types.  (See  Sternberg, 
1981b,  1982,  for  further  examples.)  In  these  examples,  the  letters  G  and  B  are 
used  to  represent  the  colored  dots  (green  or  blue)  that  were  used  to  represent 
physical  appearances  in  the  actual  stimulus  items.  The  letter  I  stands  for  "incon¬ 
sistent."  Recall  that  items  could  consist  of  either  two  verbal  descriptions, 
a  physical  description  and  a  verbal  description,  a  verbal  description  and  a  physi¬ 
cal  description,  or  two  physical  descriptions. 

Blue  Blue  G  B  I 

In  this  example,  an  object  is  described  verbally  as  blue  in  the  present  and  as  blue 
in  2000.  Clearly,  its  physical  appearance  in  2000  is  B.  This  was  an  easy  item,  with 
a  mean  response  latency  of  1.5  second. 
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Blue  Green  1  B  G 

In  this  example,  an  object  is  described  verbally  as  blue  in  the  present  but  as  green 
in  2000.  These  two  items  of  information  are  inconsistent  with  each  other,  and  hence 
the  correct  answer  is  the  letter  I.  If  the  physical  appearance  of  the  object 
changes  from  blue  in  the  present  to  green  in  2000,  the  appropriate  verbal  descrip¬ 
tion  of  the  object  in  the  year  2000  is  bleen.  If  the  physical  appearance  of  the 
object  does  not  change,  the  appropriate  verbal  description  in  the  year  2000  is 
blue.  But  an  object  cannot  correctly  be  described  as  green  in  the  year  2000  if  its 
physical  appearance  was  formerly  blue.  This  item  was  moderately  difficult,  with 
a  mean  response  latency  of  2.5  seconds. 

G  Grue  G  B  I 

In  this  example,  an  object  is  described  as  physically  green  in  the  present  but  as 
verbally  grue  in  the  year  2000.  The  object  thus  must  have  appeared  physically  green 
in  the  present  and  physically  blue  in  2000.  The  correct  answer  is  B.  This  item 
was  also  moderately  difficult,  with  a  mean  solution  latency  of  3.1  seconds. 

Bleen  B  Green  Bleen  Blue 

In  this  example,  an  object  is  described  verbally  as  bleen  in  the  present  and  physi¬ 
cally  as  B  In  2000.  One  can  infer  that  its  physical  appearance  remained  in  2000  what 
it  was  in  the  present,  blue.  The  prediction  that  the  object  would  change  in  physical 
appearance  was  incorrect.  The  correct  answer  is  blue.  This  was  a  very  difficult 
item,  with  a  mean  solution  latency  of  4.3  seconds. 

B  G  Bleen  Green  Grue 

In  this  example,  an  object  is  described  physically  as  B  in  the  present  and  as  G  in 
2000.  The  correct  verbal  description  of  the  object  in  2000  is  bleen.  This  was  a 
difficult  item,  with  a  mean  solution  latency  of  3.5  seconds. 
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Consider  the  second  instantiation  of. the  projection  task,  which  was  seen  by 
subjects  different  from  those  participating  in  the  first  experiment  (instantiation). 
In  this  experiment,  based  on  appearances  of  objects  on  the  planet  Kyron,  an  object 
was  described  as  plin  if  it  appears  solid  north  of  the  equator  and  solid  south 
of  the  equator,  as  kvef  if  it  appears  liquid  north  of  the  equator  but  solid 
south  of  the  equator,  as  bait  if  it  appears  solid  north  of  the  equator  but  liquid 
south  of  the  equator,  and  as  pros  if  it  appears  liquid  north  of  the  equator  but 
solid  south  of  the  equator.  In  each  case,  subjects  were  told  that  knowledge 
about  the  object  was  obtained  first  regarding  its  state  north  of  the  equator 
and  then  regarding  its  state  south  of  the  equator.  Hence,  "north  of  the  equator" 
corresponds  to  "the  present"  in  the  first  experiment,  and  "south  of  the  equator" 
corresponds  to  "the  year  2000"  in  the  first  experiment.  Physical  representations 
of  objects  were  either  a  filled  dot  (for  solid  physical  appearance)  or  a  hollow  dot 
(for  liquid  physical  appearance) .  Two  experiments  were  conducted  with  this 
instantiation. 

In  the  third  instantiation  of  the  projection  task,  which  was  seen  by  still 
different  subjects,  the  same  four  new  words  were  again  used,  but  their  meanings 
were  different.  Four  types  of  persons  were  alleged  to  live  on  the  planet  Kyron. 

A  person  was  described  as  plin  if  the  person  was  born  a  child  and  remained  a  child 
throughout  his  or  her  life  span.  A  person  was  described  as  kwef  if  the  person  was 
born  an  adult  and  remained  an  adult  during  the  course  of  his  or  her  life  span. 

A  person  was  described  as  bait  if  the  person  was  born  a  child  but  became  an  adult 
during  the  course  of  his  or  her  life  span.  And  a  person  was  described  as  pros 
if  the  person  was  born  an  adult  but  became  a  child  during  the  course  of  his 
or  her  life  span.  A  stick  picture  of  a  little  person  was  used  for  the  physical 
representation  of  a  child;  a  stick  picture  of  a  big  person  was  used  for  the  physical 
representation  of  an  adult. 

Each  of  approximately  25  subjects  was  tested  in  each  of  four  experiments. 
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Testing  consisted  of  a  number  of  projection-task  items  followed  by  standardized 
tests  of  inductive  reasoning  ability  from  widely  used  tests  of  general  intelligence. 
Each  subject  in  each  experiment  saw  each  of  the  36  item  types  three  times,  once 
with  the  correct  answer  in  each  of  the  three  possible  ordinal  positions. 

Mean  latencies  were  3.02  seconds  for  the  green-blue  task,  5.44  and  3.89 
seconds  for  harder  and  simpler  versions  of  the  liquid-solid  task,  and  4.15  seconds 
for  the  child-adult  task.  These  means  differed  significantly.  Latencies  were 
highly  correlated  across  experiments  (using  item  types  as  observations) ,  suggesting 
similar  information  processing  in  all  three  instantiations.  An  information-pro¬ 
cessing  model  of  task  performance  accounted  for  .94,  .92,  .91,  and  .93  of  the 
2 

variance  (R  )  in  the  green-blue,  liquid-solid  (two  versions),  and  child-adult 
task,  respectively.  RMSDs  for  the  respective  tasks  were  .20,  .43,  .30,  and  .28 
seconds.  Residuals  were  significant  only  in  one  variant  of  the  second  (liquid- 
solid  task),  indicating  that  the  model  did  an  exceptionally  good  job  of  accounting 
for  the  task  latencies. 

The  most  interesting  results  in  this  experiment  were  correlations  of  global 
task  and  parameter  latencies  with  scores  on  the  inductive-reasoning  items  from 
the  IQ  tests.  Global  correlations  of  task  scores  with  psychometrically-derived 
scores  were  -.69,  -.77,  -.61,  and  -.48  for  the  four  respective  experiments.  These 
correlations  are  not  only  consistently  high  and  significant,  but  they  are  obviously 
higher  than  those  obtained  for  any  of  the  other  cognitive  tasks  currently  being 
studied  by  investigators  of  intelligence  in  their  laboratories.  The  correlations 
are  thus  consistent  with  the  notion  that  performance  on  nonentrenched  tasks  is 
related  to  intelligence  in  a  particularly  central  way.  Moreover,  patterns  of 
correlations  of  parameter  scores  with  the  psychometric  scores  made  good  sense. 
Parameters  representing  complex  conceptual  transformations  were  the  ones  that 
tended  to  be  responsible  for  the  higher  levels  of  correlation,  whereas  parameters 
representing  simpler  operations  were  only  trivially  correlated  with  psychometric 
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test  scores.  Thus,  the  componential  decomposition  of  task  performance  enabled 
me  to  extract  those  components  of  information  processing  that  were  critical  to 
"nonentrenched"  reasoning. 

Obviously,  the  situations  used  in  this  task  were  highly  artificial.  This 
is  something  that  1  plan  to  remedy  in  future  research,  using  situations  such 
as  changes  of  colors  of  leaves.  But  the  critical  finding  is  that  the  abilities 
required  to  perform  nonentrenched  tasks  successfully  and  to  process  nonentrenched 
concepts  within  such  tasks  appear  to  be  consequential  to  measuring  individual 
differences  in  intelligence.  These  experiments  must  therefore  be  seen  as  a  first- 
pass  at  this  kind  of  measurement  in  an  artificial  task  environment,  with  more 
realistic  task  environments  to  follow.  Certainly,  the  results  of  both  the 
modeling  and  the  correlations  hold  promise  for  further  exploration  of  the 
concept  of  nonentrenchment  in  intelligent  performance. 
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Summary .  To  summarize,  I  have  presented  a  componential  theory  of  analogi¬ 
cal  reasoning  processes  and  shown  how  it  can  be  generalized  to  other  kinds  of 
induction  problems,  namely,  series  completions,  classifications,  and  metaphors 
I  have  also  presented  theories  of  response  choice  in  causal  inference  and  of 
projection  of  nonentrenched  events.  In  the  next  section,  I  consider  the  natur 
of  fluid  abilities  that  are  deductive  rather  than  inductive  in  nature. 
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Fluid  Abilities :  Deduction 

Our  investigations  of  fluid  abilities  of  deductive  reasoning  have  led  us 
to  investigations  of  variants  of  three  types  of  syllogisms:  linear  syllogisms 
(Sternberg,  1980a,  1980b,  1980c;  Sternberg  &  Weil,  1980),  and  categorical  and 
conditional  syllogisms  (Guyote  &  Sternberg,  1981;  Sternberg  &  Turner,  1981). 
Examples  of  the  various  kinds  of  syllogisms  my  collaborators  and  I  have  studied 
are  shown  in  Table  2.  Consider  first  the  solution  of  linear  syllogisms  and  next 


Insert  Table  2  about  here 


the  solution  of  categorical  and  conditional  syllogisms.  Solution  of  the  latter 
two  kinds  of  syllogisms  can  be  accounted  for  by  a  single  theory,  but  a  theory 
different  frc-  that  accounting  for  the  solution  of  linear  syllogisms.  Because 
the  number  and  complexity  of  the  performance  components  used  to  solve  deduction 
problems  are  greater  than  for  induction  problems,  the  theories  will  be  described 
only  in  outline.  Details  are  contained  in  the  original  papers. 

Linear  syllogisms.  The  proposed  theory  of  linear  syllogistic  reasoning 
processes  attempts  to  account  for  linear  syllogistic  reasoning  in  terms  of  the 
operation  of  eight  distinguishable  performance  components.  These  components 
are  (a)  encoding,  which  includes  time  to  read  the  premises  and  to  combine  them 
into  a  spatial  array;  (b)  negation .  including  incremental  time  to  understand 
negated  propositions  (over  time  required  to  understand  affirmative  propositions); 
(c)  narking,  including  Incremental  time  to  understand  marked  adjectives  such 
as  shorter,  verse,  and  slower  (over  time  required  to  understand  unmarked 
adjectives  such  as  taller .  better,  and  faster) ;  (d)  pivot  search,  including 
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Sample  Syllogisms 

Linear  Syllogisms 


Sam 

is 

taller 

than 

Joe. 

B.  Sam 

is 

better  than  Joe 

Joe 

is 

taller 

than 

Tom. 

Sam 

is 

worse  than  Tom. 

Who 

is 

tallest 

Who 

is 

worst? 

Joe 

Tom 

Sam 

Tom 

Sam  Joe 

C.  Sam  Is  not  as  fast  as  Joe. 

Tom  Is  not  as  fast  as  Sam. 

Who  is  slowest? 

Joe  Sam  Tom 

[The  subject's  task  is  to  answer  the  question  with  one  of  the  options.] 


Categorical  Syllogisms 


A.  No  C  are  B. 
All  B  are  A. 


B.  No  cottages  are  skyscrapers. 
All  skyscrapers  are  buildings. 


All  A  are  C. 

No  A  are  C. 

Some  A  are  C. 

Some  A  are  not  C. 
None  of  the  above. 


All  buildings  are  cottages. 

No  buildings  are  cottages. 

Some  buildings  are  cottages. 

Some  buildings  are  not  cottages. 
None  of  the  above. 


C.  No  milk  cartons  are  containers. 
All  containers  are  trash  cans. 


All  trash  cans  are  milk  cartons. 

No  trash  cans  are  milk  cartons. 

Some  trash  cans  are  milk  cartons. 

Some  trash  cans  are  not  milk  cartons. 
None  of  the  above. 


D.  No  headphones  are  ducks. 
All  ducks  are  bottles. 


All  bottles  are  headphones. 

No  bottles  are  headphones. 

Some  bottles  are  headphones. 

Some  bottles  are  not  headphones. 
None  of  the  above. 


[The  subject's  task  is  to  choose  the  logically  correct  option.] 

Conditional  Syllogism 


If  A  then  B. 

Not  B.  /Therefore,  not  A. 


(The  aubject'a  task  is  to  indicate  whether  the  syllogism  is  logically  valid.) 
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time  to  locate  the  middle  term  of  the  three-item  linear  array  inferred  from 
the  terms  of  the  linear  syllogism;  (e)  response  search,  including  time  to 
locate  the  correct  response  to  the  question  asked  in  the  linear  syllogism; 

(f)  noncongruence,  including  time  to  rephrase  the  question  if  it  is  asked  in 
terms  of  an  adjective  different  from  that  in  which  the  correct  answer  to  the 
question  happened  to  be  encoded  (as  when  the  correct  answer  is  encoded  in  terms 
of  "tallness"  and  the  question  asks  who  is  "shortest;"  (g)  question  reading, 
including  the  time  to  read  the  question  posed  by  the  linear  syllogism;  and 
(h)  response ,  including  the  time  to  communicate  one's  perferred  answer. 

In  this  theory  as  in  the  theory'  of  inductive  reasoning,  it  is  assumed 
that  the  performance  components  are  executed  in  a  sequential  fashion.  It  is 
theorized  that  both  linguistic  and  spatial  forms  of  mental  representation  are 
used  at  various  points  in  the  sequence  of  information  processing.  In  particular, 
it  is  theorized  that  the  problem  is  first  linguistically  decoded  such  that  the 
presented  surface  structure  is  converted  to  a  linguistic  deep  structure  (see 
Clark,  1969),  and  then  spatially  recoded,  such  that  a  spatial  array  is  formed 
that  represents  the  linear  ordering  of  relations  among  the  terms  of  the  problem. 
Certain  components  are  theorized  to  act  upon  the  linguistic  representation  only 
(noncongruence,  question  reading),  others  upon  the  spatial  representation  only 
(pivot  search,  response  search),  and  others  upon  both  forms  of  representation 
(encoding,  negation,  marking).  The  response  component  does  not  act  upon  a 
representation,  but  merely  provides  a  means  of  communication. 

The  proposed  theory  of  linear  syllogistic  reasoning  was  tested  against 
several  alternative  theories:  a  purely  spatial  theory  (DeSoto,  London,  i 
Handel,  1965;  Huttenlocher ,  1968;  Huttenlocher  &  Higgins,  1971),  positing 


trivial  linguistic  processing  and  a  set  of  processes  acting  6olely  upon  a 
spatial  representation  for  information;  a  purely  linguistic  theory  (Clark, 

1969)  positing  a  set  of  processes  acting  solely  upon  a  linguistic  representation 
for  information;  a  spatial  to  linguistic  theory  (Johnson-Laird,  1972;  Wood, 
Shotter,  &  Godden,  1974)  positing  spatial  processing  when  one  first  encounters 
linear  syllogisms  and  linguistic  processing  only  later  after  some  practice, 
and  a  linguistic  to  spatial  theory  (Shaver,  Pierson,  &  Lang,  1974)  positing 
linguistic  processing  when  one  first  encounters  linear  syllogisms  and  spatial 
processing  only  later  on.  The  theories  were  compared  in  a  number  of  experiments- 
the  results  of  one  experiment  (Sternberg,  1980c , Experiment  3)  will  be  stressed 
here,  but  the  results  of  all  of  the  experiments  were  in  concordance. 

Subjects  in  the  experiment  were  18  Yale  undergraduates,  each  of  whom 
received  288  linear  syllogisms  using  the  adjective  pairs  taller-shorter,  better- 
worse,  and  faster-slower.  The  main  dependent  variable  was  response  time,  and 
independent  variables  were  formed  by  manipulating  aspects  of  item  structure 
so  as  to  make  it  possible  to  separate  the  proposed  components  of  information 
processing.  Items  were  administered  tachistoscopically ,  and  administration 
of  the  items  was  followed  up  by  the  administration  of  a  series  of  paper-and- 
pencll  psychometric  tests  of  linguistic  and  spatial  abilities. 

Mean  response  time  was  7.0  seconds  with  an  error  rate  of  .01.  The 
proposed  theory  accounted  for  .84  of  the  variance  in  the  group-mean  latency 
data,  with  an  RMSD  of  .38  seconds.  The  residual  (unaccounted  for)  variance 
was  statistically  significant,  Indicating  that  the  proposed  theory  was  not  the 
true  one.  However,  the  proposed  theory  did  do  a  superior  job  of  accounting  for 
the  data  relative  to  any  of  the  alternative  theories  considered.  This 
superiority  holds  up  across  age  levels  from  grade  3  to  college  (Sternberg, 

1980a). 
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The  patterns  of  correlation  between  global  task  and  parameter  scores,  on 
the  one  hand,  and  psychometric  test  scores,  on  the  other,  were  of  particular 
Interest  in  this  research  because  they  provided  an  additional  means  of 
distinguishing  among  the  alternative  theories.  In  particular,  my  mixture 
theory  predicted  significant  correlations  of  global  task  scores  with  both 
linguistic  and  spatial  ability  test  scores,  the  linguistic  theory  predicted 
a  significant  correlation  only  with  the  linguistic  ability  score,  the  spatial 
theory  predicted  a  significant  correlation  only  with  the  spatial  ability  score. 
The  strategy-change  theories  predicted  changing  patterns  of  correlation  with 
changes  in  level  of  practice.  The  individual  theories  also  made  specific 
predictions  regarding  patterns  of  correlation  for  specific  parameter  estimates 
with  the  ability  scores.  Correlations  to  be  presented  are  for  the  results  of 
four  experiments  combined  (Sternberg,  1980c). 

As  it  turned  out,  the  global  linear-syllogisms  task  score  correlated 
significantly  with  both  the  linguistic  and  spatial  ability  test  scores.  The 
obtained  correlations  were  -.46  and  -.56  with  linguistic  and  spatial  abilities 
respectively.  The  significance  and  approximate  levels  of  the  correlations  held 
up  over  practice,  thus  providing  no  support  for  the  strategy-change  theories. 
Moreover,  the  pattern  of  correlations  of  Individual  parameter  estimates  with 
the  test  scores  was  almost  exactly  that  predicted  by  the  mixture  theory,  with 
linguistic  parameters  correlating  significantly  with  the  linguistic  but  not 
with  the  spatial  tests,  spatial  parameters  correlating  significantly  with 
the  spatial  but  not  with  the  linguistic  tests,  mixed  parameters  correlating 
significantly  with  both  kinds  of  tests,  and  the  response  parameter  correlating 
significantly  with  neither  kind  of  test. 
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The  proposed  model  of  linear  syllogistic  reasoning  was  originally  proposed 
for  determinate  linear  syllogisms,  that  is,  ones  with  a  uniquely  determinable 
answer.  It  can  be  extended  to  indeterminate  problems,  however,  such  as  "John 
is  taller  than  Mary.  John  is  taller  than  Susan."  If  one  asks, "Who  is  tallest?" 
the  problem  is  readily  answerable.  But  if  one  asks,  "Who  is  shortest?"  the 
problem  is  not  readily  answerable,  since  insufficient  information  is  given 
to  determine  whether  Mary  or  Susan  is  the  shorter  one.  By  the  addition  of 
two  performance  components  to  the  proposed  mixture  theory,  the  theory  can  handle 
problems  of  this  kind.  In  an  experiment  to  test  the  augmented  theory,  the 
theory  accounted  for  .80  of  the  variance  in  determinate  problems  only,  .93 
of  the  variance  in  indeterminate  problems  only,  and  .89  of  the  variance  in 
the  two  kinds  of  problems  considered  together  (Sternberg,  1981c  ). 

Testing  of  my  proposed  theory  of  linear  syllogistic  reasoning  on  other 
data  sets  in  the  literature  revealed  a  curious  finding:  Although  most  data 
sets  seemed  to  support  my  own  theory,  a  few  supported  other  theories.  An 
experiment  was  performed  to  resolve  the  "curious  conflicts"  in  the  literature 
(Sternberg,  1980b) .  Eighteen  adult  subjects  received  linear  syllogisms  under 
instructions  designed  to  yield  speeds  commensurate  with  error  rates  of  about  10'.. 
Latency  and  error  data  were  analyzed  both  separately  (via  multiple  regression) 
and  jointly  (via  canonical  regression) .  The  data  were  also  analyzed  via  pseudo¬ 
deadlines,  according  to  which  responses  were  counted  as  correct  if  they  were  cor¬ 
rect  and  fell  below  a  given  pseudodeadline  and  were  counted  as  erroneous  if  they 
were  Incorrect  or  fell  above  a  given  pseudodeadline.  (This  procedure  mimicked 
procedures  used  by  others,  where  a  genuine  deadline  was  set  for  problem  completion.) 
The  analyses  revealed  the  source  of  conflicts  to  be  in  the  complex  interrelationships 
between  latency  and  error  rate.  In  particular,  the  mixture  theory  better  accounted 
for  the  data  when  these  relationships  were  taken  into  account. 


64 


The  data  analyses  I  have  presented  in  this  article  have  generally  sufficed 
to  identify  one  theory  as  better  than  its  competitors,  usually  in  the  sense 
that  the  squared  correlation  between  that  theory's  predictions  and  the  observed 
group-mean  data  is  higher  than  the  squared  correlations  between  the  predictions 
of  alternative  theories  and  the  observed  group-mean  data.  But  such  model 
fitting  only  describes  the  behavior  of  most  of  the  people  most  of  the  time, 
not  all  of  the  people  all  of  the  time. 

Sternberg  and  Weil  (1980)  investigated  the  range  of  variation  over  subjects 
in  strategy  use.  They  found  that  of  48  subjects  left  to  their  own  devices 
regarding  strategy  selection,  30  employed  a  mixture  strategy,  7  a  linguistic 
strategy,  5  a  spatial  strategy,  and  6  a  short-cut  strategy  that  bypassed  formal 
reasoning  processes.  (Strategy  use  was  determined  by  fitting  each  theory  to 
each  individual  subject's  data  and  by  fitting  the  data  to  each  theory.  A 
subject  was  identified  as  using  that  strategy  corresponding  to  the  theory  that 
best  fit  the  individual  subject’s  data.)  To  validate  this  individual  strategy 
assignment  further,  global  linear-syllogism  response  times  were  correlated  with 
scores  on  linguistic  and  spatial  ability  tests.  The  correlational  patterns 
were  exactly  as  predicted.  Respective  correlations  with  linguistic  and 
spatial  ability  were  -.45  and  -.27  in  the  mixture  strategy  group  (both 
statistically  significant),  -.76  and  -.28  in  the  linguistic-strategy  group 
(only  the  former  correlation  statistically  significant),  -.08  and  -.61  in 
the  spatial-strateev  group  (only  the  latter  correlation  statistically 
significant),  and  -.32  and *28  in  the  algorithmic  strategy  group  (only  the 
former  correlation  statistically  significant,  but  both  correlations  reduced 
in  magnitude,  presumably  because  of  the  bypassing  of  formal  reasoning). 


These  data  show  the  importance  of  considering  individual  as  well  as  group 
fits  of  theory  and  data  in  order  to  understand  the  full  range  of  variation 
ir.  human  intellectual  performance. 

Categorical  and  conditional  syllogisms .  Our  theory  of  categorical  and 
conditional  syllogistic  reasoning  (Guyote  &  Sternberg,  1981;  Sternberg  & 
Turner,  1981)  is  addressed  primarily  to  bases  for  choosing  responses. 
According  to  this  theory  and  others,  syllogistic  reasoning  can  be  divided 
into  four  sequential  stages:  encoding,  combination,  comparison,  and  response. 
Where  theories  differ  is  in  terms  of  what  happens  in  each  stage.  The  theory 
is  called  a  "transitive-chain"  theory  because  it  is  theorized  that  syllogisms 
are  solved  by  transitive  inferences  on  premise  representations. 

In  the  encoding  stage,  individuals  read  and  interpret  the  premises  (see 
Table  2).  According  to  our  "transitive-chain"  theory,  the  encoding  stage  is 
error-free,  that  is,  subjects  solving  categorical  and  conditional  syllogisms 
encode  the  premises  completely  and  correctly.  Although  this  is  obviously  a 
simplifying  assumption,  it  fits  our  data  quite  well,  and  the  addition  of 
parameters  into  our  theory  for  errors  in  encoding  does  little  to  change  fits 
of  the  theory  to  the  data. 

In  the  combination  stage,  individuals  combine  encoded  information  in  order 
to  deduce  relations  between  the  terms  presented  in  the  major  (first)  and  minor 
(second)  premise  of  the  syllogism.  It  is  assumed  that  errors  during  the 
combination  stage  are  due  to  limitations  of  working  memory.  In  other  words, 
individuals  find  that  the  number  of  possible  combinations  of  set  relations 
between  the  two  premises  (which  can  range  from  1  to  16  across  the  full  range 
of  types  of  syllogisms)  can  exceed  the  capacities  of  their  working  memories 
to  hold  all  these  relations.  We  assume  that  individuals  never  process  more 
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than  four  possible  set  relations,  but  that  they  may  process  fewer.  The 
probabilities  of  combining  exactly  one,  two,  three,  and  four  set  relations 
can  be  represented  by  four  parameters,  PI,  P2,  P3,  and  P4  respectively. 

An  example  may  help  concretize  the  nature  of  the  combination  process. 
Consider  the  sample  syllogism  of  Table  2,  "No  C  are  B.  All  B  are  A."  The 
premise,  "No  C  are  B,"  can  be  represented  in  only  one  way,  as  nonintersecting 
sets.  The  premise,  "All  B  are  A"  can  be  represented  in  two  ways,  as  completely 
coincident  sets,  A  identical  to  B,  and  as  B  functioning  as  a  subset  of  A. 

There  are  thus  1x2*2  ways  of  combining  the  premises  in  this  particular 
problem.  A  problem  with  the  fewest  possible  combinatorial  possibilities 
is  "No  C  are  B.  No  B  are  A,"  which  yields  1x1*1  possible  combination  of 
set  relations.  A  problem  with  the  most  possible  combinatorial  possibilities 
is  "Some  B  are  C.  Some  A  are  B,"  which  yields  4  x  4  *  16  possible  combinations 
of  set  relations. 

In  the  comparison  stage,  individuals  compare  the  mental  results  of  their 
combination  process  to  the  presented  answer  options  among  which  they  will  have 
to  choose.  Errors  are  theorized  to  be  attributable  to  three  causes.  One  cause 
is  a  bias  toward  selecting  a  conclusion  whose  atmosphere  matches  the  atmosphere 
of  the  premises,  where  atmosphere  is  defined  as  negative  if  there  is  at  least 
one  negated  premise  (e.g..  No  B  are  C)  and  as  particular  if  there  is  at  least 
one  particular  premise  (e.g..  Some  B  are  C).  A  second  cause  is  a  bias  toward 
conclusions  both  that  match  the  atmosphere  of  the  premises  and  that  are  more 
restrictive  in  their  claims  (e.g..  All  is  more  restrictive  than  Some) .  A  third 
cause  is  a  response  bias  toward  deciding  a  problem  is  Indeterminate  (l.e.,  no 
conclusion  follows)  when  mental  representations  of  the  premise  Information  do  not 
natch  in  certain  respects  (see  Guyote  &  Sternberg,  1961). 
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In  the  response  stage,  individuals  are  theorized  merely  to  communicate 
their  proposed  response.  This  stage  is  assumed  to  be  error-free.  To  the 
extent  that  individuals  occasionally  push  the  wrong  button  or  unintentionally 
mark  an  answer  other  than  the  selected  one,  the  predictors  of  the  model  will 
be  in  error. 

The  theory  of  response  choice  is  a  nonadditive  model,  although  it  was 
assumed  that  the  corresponding  processing  components  contributing  to  response 
latency  were  executed  in  a  sequential  fashion.  Our  proposed  representation 
of  information  and  the  proposed  rules  for  integrating  information  would  require 
more  space  to  explain  adequately  than  is  available  here  (but  see  Guyote  6. 
Sternberg,  1981).  Basically,  individuals  are  assumed  to  translate  relations 
between  terms  in  the  syllogistic  premises  into  a  propositional  form,  and  then 
to  combine  information  in  the  premises  according  to  two  simple  rules.  The 
derived  propositional  information  from  the  combination  stage  is  compared  against 
propositional  information  inherent  in  the  answer  options  in  order  to  select  a 
response. 

Subjects  in  our  experiments  were  Yale  undergraduates.  They  received  a 
variety  of  kinds  of  categorical  and  conditional  syllogisms,  as  shown  in  the 
examples  of  Table  2.  Not  only  did  item  formats  differ,  but  item  contents  in 
one  experiment  differed  as  well,  including  syllogisms  with  factual,  counter- 
factual,  and  anomalous  premises.  In  the  other  experiments,  abstract  content 
(letters  of  the  alphabet)  was  used  in  the  premises.  Both  response  choices 
and  response  times  served  as  primary  dependent  variable.  Independent 

variables  were  various  aspects  of  item  structure  experimentally  manipulated 
to  permit  isolation  of  the  various  performance  conroonentf .  of  syllogistic 
reasoning.  Subjects  were  tested  either  via  a  computer  terminal,  a  tachisto- 
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scope,  or  a  booklet  (paper-and-pencil) .  Following  the  experimental  task, 
subjects  in  sore  of  the  experiments  received  psychometric  tests  of  linguistic 
and  spatial  abilities. 

Data  from  Guyote  and  Sternberg  (1981)  (categorical  syllogisms  with  abstract 

premises,  such  as  the  first  example),  provide  good  support  for  the  proposed 

theory.  The  mean  response  time  was  39.5  seconds  for  categorical  syllogisms 

with  abstract  content  and  the  mean  error  rate  was  .43.  The  response-choice 

theory  accounted  for  .97  of  the  variance  in  the  response-choice  data,  with  an 

RMSD  of  .05.  The  residual  variance  was  statistically  significant,  despite  the 

2 

very  high  value  of  R  .  The  theory  accounted  for  .88  of  the  variance  in  the 
response-time  data,  with  an  RMSD  of  .29  second.  Again,  the  residual  variance 
was  statistically  significant. 

The  model  also  fared  quite  well  in  accounting  for  the  data  for  conditional 
syllogisms  with  abstract  content  in  the  premises,  for  example,  "If  A  then  B.  A. 
Is  B  a  valid  conclusion?"  The  mean  response  time  for  syllogisms  of  this  kind 
was  13.5  seconds,  and  the  mean  error  rate  was  .17.  The  proposed  theory 
accounted  for  .95  of  the  variance  in  the  data  with  an  RMSD  of  .10.  Again,  the 
residual  variance  was  statistically  significant.  For  response  time,  the  theory 
accounted  for  .84  of  the  variance  in  the  data,  with  an  RMSD  of  .29  second. 

The  residual  variance  was  significant. 

Where  possible,  the  predictions  of  an  alternative  theory,  that  of 
Erickson  (1974,  1978),  were  compared  to  the  predictors  of  our  own  theory. 

This  theory  differs  from  our  own  positing  in  that  errors  in  syllogistic 
reasoning  are  due  to  incomplete  information  processing  in  each  of  the 
encoding,  combination,  and  comparison  stages  of  syllogistic  reasoning. 
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The  response  biases  built  into  our  theory  do  not  appear  in  Erickson's  theory. 

For  those  experiments  where  comparison  of  the  two  theories  was  possible 
(Erickson's  theory  has  been  applied  only  to  response-choice  data  for  categorical 
syllogisms  with  two  statement  premises),  our  transitive-chain  theory  better 
accounted  for  the  data. 

Proportion  correct  scores  on  the  syllogistic  reasoning  task  were  correlated 
with  scores  on  the  psychometric  ability  tests.  Significant  correlations  were 
obtained  with  spatial  ability  scores  (.42  and  .54  for  categorical  and  conditional 
syllogisms,  respectively).  It  thus  appears  that  the  major  source  of  individual 
differences  in  categorical  and  conditional  syllogisms  is  in  operation  upon  a 
spatial  rather  than  upon  a  linguistic  mental  representation.  Since  the  components 
of  the  combination  stage  represent  limitations  in  operations  upon  a  mental 
representation  in  working  memory,  whereas  the  components  of  the  comparison 
stage  merely  represent  response  biases,  one  might  expect  the  source  of  the 
obtained  correlations  to  be  localized  in  the  combination  stage.  This  was  in 
fact  the  case. 

Summary .  To  summarize,  I  have  presented  two  componential  theories  of 
deductive  reasoning — a  mixture  theory  of  linear  syllogistic  reasoning  and  a 
transitive-chain  theory  of  categorical  and  conditional  syllogistic  reasoning. 

Two  different  theories  were  required  because  the  information-processing 
requirements  of  categorical  and  conditional  syllogisms  differ  considerably 
from  the  information-processing  requirements  of  linear  syllogisms.  Nevertheless, 
the  theories  have  certain  communalities ,  most  notably,  the  solution  of  problems 
by  transitive  inferences.  Another  important  communality  is  the  importance  of 
a  spatial  mental  representation  in  both  kinds  of  syllogistic  reasoning.  Our 
theories  predict  an  important  role  for  linguistic  Information  processing  as 
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well,  but  this  prediction  was  supported  only  for  the  linear  syllogisms, 
the  high  levels  of  fit  of  the  theories  to  data  provide  good  support  for 

i 

proposed  theories  of  information  processing. 


Overall, 

the 


i 


Conclusions 


1  have  presented  a  componential  view  of  human  intelligence  whereby  intelli¬ 
gence  is  understood  in  terms  of  the  functioning  of  various  kinds  of  information¬ 
processing  components,  and  in  terms  of  the  interactions  among  and  automatization 
of  these  components.  At  the  heart  of  the  componential  view  is  the  functioning 
of  the  metacomponents,  or  executive  processes,  which  directly  activate  and 
receive  feedback  from  the  other  kinds  of  components,  in  particular,  components 
of  performance,  acquisition,  retrieval,  and  transfer.  These  kinds  of  com¬ 
ponents  interact  indirectly  with  each  other  through  the  mediation  of  the 

s' 

metacomponents. 

The  componential  view  (or  metatheory)  of  human  intelligence  has  been 
instantiated  in  a  number  of  theories  of  intellectual  task  performance,  each 
of  which  I  view  as  a  subtheory  of  human  intelligence  considered  as  a  whole, 
integral  entity.  These  subtheories  deal  with  two  broad  classes  of  tasks, 
those  requiring  constellations  of  abilities  that  can  be  characterized  as  fluid 
and  those  requiring  constellations  of  abilities  that  can  be  characterized  as 
crystallized.  This  division  of  tasks  is  not  wholly  arbitrary,  in  that  it 
emerges  both  from  certain  psychometric  theories  of  intelligence  and  from 
an  analysis  of  people's  conceptions  of  intelligence. 

The  proposed  subtheories  of  intelligence  have  been  tested  via  the  use 
of  several  different  kinds  of  response  measures  (response  time,  response  choice, 
error  rate,  ratings)  and  on  a  fairly  wide  variety  of  item  contents  and  formats. 
Results  of  empirical  tests  of  the  task  theories  have  been  generally  quite  favor¬ 
able,  both  In  an  absolute  sense  and  in  comparison  to  alternative  theories,  al¬ 
though  none  of  the  theories  can  be  viewed  as  "true.”  In  each  case,  there  is 
statistically  significant  variance  in  the  data  that  is  not  accounted  for  by 


the  theory 
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Future  research  on  intelligence  faces  a  number  of  difficult  questions, 
among  them,  the  kinds  of  tasks  we  ought  to  be  studying  and  the  kinds  of  methods 
we  ought  to  be  using  to  study  them.  I  would  like  to  close  this  section  with 
some  final  remarks  addressing  each  of  these  issues  as  they  pertain  to  my  own 
program  of  research. 

Although  virtually  all  researchers  would  view  both  fluid  and  crystallized 
abilities  as  critical  in  the  functioning  of  adaptive  intelligence,  the  emphasis 
in  my  own  research  program  has  been  very  heavily  weighted  toward  the  investi¬ 
gation  of  components  of  fluid  abilities,  as  exemplified  by  the  reasoning  tasks 
considered  above.  A  high  priority  for  future  research,  therefore,  is  to  seek 
a  comparable  breadth  and  depth  of  understanding  for  tasks  drawing  upon  crys¬ 
tallized  abilities,  for  example,  vocabulary  and  reading  comprehension.  The 
emphasis  in  my  research  over  the  next  few  years,  in  terms  of  the  kinds  of  tasks 
to  be  studied,  will  be  upon  verbal  tasks. 

Although  the  componential  methodology  developed  in  my  previous  work  has 
been,  I  believe,  quite  highly  successful  and  flexible  in  studying  a  variety  of 
fluid-ability  tasks  and  even  some  more  verbally-oriented  ones,  it  almost  cer¬ 
tainly  will  need  supplementation  when  brought  to  bear  upon  the  verbal  skills 
required  in  academic  and  practical  functioning.  Recent  "cognitive-science" 
approaches  to  cognition,  emphasizing  as  they  do  the  representation  of  procedural 
and  declarative  knowledge,  seem  to  provide  the  kind  of  supplementation  my  largely 
process-oriented  methodology  would  need  for  the  investigation  of  verbal  per¬ 
formance.  The  emphasis  in  my  research  over  the  next  few  years,  in  terms  of 
methodology  for  studying  tasks,  will  be  to  integrate  componential  and  cognitive- 
scientific  methodologies  in  order  to  obtain  a  methodology  that  is  more  powerful 
than  either  methodology  taken  by  Itself.  (This  integration  represents  what  I 
see  as  the  next  major  step  for  componential  analysis,  which  originally  was  formu¬ 
lated  as  an  Integration  of  psychometric  and  information-processing  methodologies.) 
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1  Dr.  Mike  Posner 
Department  of  Psychology 
University  of  Oregon 
Eugene,  OR  97403 

1  Dr.  Fred  Reif 
Physics  Department 
University  of  California 
Berkeley,  CA  94720 

1  Dr.  Lauren  Resnick 
LRDC 

University  of  Pittsburgh 
3939  O'Hara  Street 
Pittsburgh,  PA  1521 

1  Mary  S.  Riley 

Program  in  Cognitive  Science 
Center  for  Human  Information  Processing 
University  of  California,  San  Diego 
La  Jolla,  CA  92093 
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1  Dr.  Andrew  M.  Rose 

American  Institutes  for  Research 
1055  Thomas  Jefferson  St.  NW 
Washington,  DC  20007 

1  Dr.  Ernst  Z.  Rothkopf 
Bell  Laboratories 
Murray  Hill,  NJ  0797* 

1  Dr.  William  B.  Rouse 
Georgia  Institute  of  Technology 
School  of  Industrial  &  Systems 
Engineering 
Atlanta,  GA  30332 

1  Dr.  David  Rum el hart 

Center  for  Human  Information  Processing 
Univ.  of  California,  San  Diego 
La  Jolla.  CA  92093 

1  PROF.  FUMIKO  SAMEJIMA 
DEPT.  OF  PSYCHOLOGY 
UNIVERSITY  OF  TENNESSEE 
KNOXVILLE.  TN  37916 

1  Dr.  Arthur  Samuel 
Yale  University 
Department  of  Psychology 
Box  11A,  Yale  Station 
New  Haven,  CT  06520 

1  Dr.  Walter  Schneider 
Psychology  Department 
603  E.  Daniel 
Champaign,  IL  61820 

1  Dr.  Alan  Schoenfeld 

Mathematics  and  Education 
The  University  of  Rochester 
Rochester,  NY  1*627 

1  DR.  ROBERT  J.  SEIDEL 

INSTRUCTIONAL  TECHNOLOGY  GROUP 
HUMRR0 

300  N.  WASHINGTON  ST. 

ALEXANDRIA,  VA  22314 

1  Dr.  David  Shucard 
Brain  Sciences  Labs 

National  Jewish  Hospital  Research  Cente 
National  Asthma  Center 
Denver,  CO  80206 


1  Dr.  Edward  E.  Smith 
Bolt  Beranek  &  Newman,  Inc. 

50  Moulton  Street 
Cambridge,  MA  02138 

1  Dr.  Richard  Snow 
School  of  Education 
Stanford  University 
Stanford,  CA  9*305 

1  Dr .  Kathryn  T.  Spoehr 
Psychology  Department 
Brown  University 
Providence,  RI  02912 

1  Dr.  Albert  Stevens 

Bolt  Beranek  &  Newman,  Inc. 

10  Moulton  St. 

Cambridge,  MA  02238 

1  DR.  PATRICK  SUPPES 
INSTITUTE  FOR  MATHEMATICAL  STUDIES  IN 
THE  SOCIAL  SCIENCES 
STANFORD  UNIVERSITY 
STANFORD,  CA  9*305 

1  Dr.  Kikumi  Tatsuoka 
Computer  Based  Education  Research  Lab 
252  Engineering  Research  Laboratory 
Urbana,  IL  61801 

1  Dr.  Perry  W.  Thorndyke 
Perceptronics,  Inc. 

5*5  Middlefield  Road,  Suite  1*0 
Menlo  Park,  CA  9*025 

1  Dr.  Douglas  Towne 

Univ.  of  So.  California 
Behavioral  Technology  Labs 
16*5  S.  Elena  Ave. 

Redondo  Beach,  CA  90277 

1  Dr.  Benton  J.  Underwood 
Dept,  of  Psychology 
Northwestern  University 
Evanston,  IL  60201 

1  DR.  GERSH0N  WELTMAN 
PERCEPTRONICS  INC. 

6271  VARIEL  AVE. 

WOODLAND  HILLS,  CA  91367 
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1  Dr.  Keith  T.  Wescourt 
Perceptronics,  Inc. 

5**5  Middlefield  Road,  Suite  1U0 
Menlo  Park,  CA  9**025 

1  DR.  SUSAN  E.  WHITELY 
PSYCHOLOGY  DEPARTMENT 
‘  UNIVERSITY  OF  KANSAS 
Lawrence,  KS  66045 

•  1  William  B.  Whitten 
Bell  Laboratories 
2D-610 

Holmdel ,  NJ  07733 

1  Dr.  Christopher  Wickens 
Department  of  Psychology 
University  of  Illinois 
Champaign,  IL  61820 


